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ALTERATION OF AMINO ACID COMPOSITIONS IN SEEDS 



BACKGROUND OF THE INVENTION 

Feed formulations based on crop plants must typically 
be supplemented with specific amino acids to provide animals 
with essential nutrients which are critical to their growth 
This supplementation is necessary because, in general, crop 
Plants contain low proportions of several amino acids which 
are essential for, and cannot be synthesized by, monogastric 
animals . 

The seeds of crop plants contain different classes of 
seed proteins. The amino acid composition of these seeds 
reflects the composition of the prevalent classes of 
proteins. Amino acid limitations are usually due to amino 
acid deficiencies of these prevalent protein classes. 

Among the amino acids necessary for animal nutrition, 
those that are of limited availability in crop plants 
include methionine, lysine and cysteine. For example, in 
soybean, the 7s globulin accounts for about 30% of the seed 
proteins but contains only 0.3% of methionine, whereas the 
Bowman-Birk inhibitor rBBI", accounts for approximately 1% 
of seed proteins but contains approximately 20% sulfur 
containing amino acids. Attempts to increase the levels of 
these amino acids by breeding, mutant selection, and/or 
changing the composition of the storage proteins accumulated 
in the seeds of crop plants, have met with limited success, 
or were accompanied by a loss in yield. 

For example, although seeds of corn plants containing a 
mutant transcription factor, (opaque 2), or a mutant cc-zein 
gene, (floury 2), exhibit elevated levels of total and bound 
lysine, there is an altered seed endosperm structure which 
as more susceptible to damage and pests. Significant yield 
losses are also typical. 
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An alternative means to enhance levels of free amino 
acids in a crop plant is the modification of amino acid 
biosynthesis in the plant. The introduction of a feedback- 
regulation-insensitive dihydrodipicolinic acid synthase 
PDHDPS") gene, which encodes an enzyme that catalyzes the 
first reaction unique to the lysine biosynthetic pathway, 
into plants has resulted in an increase in the levels of 
free lysine in the leaves and seeds of those plants. 
However, these increases are insufficient to significantly 
increase the total amino acid content of the seed because 
the level of free amino acid in seeds is, in general, only a 
minor fraction of the total amino acid content. 

The expression of the lysC gene, which encodes a mutant 
bacterial aspartate kinase that is desensitized to feedback 
inhibition by lysine and threonine, from a seed-specific 
promoter in tobacco plants, has resulted in an increase in 
methionine and threonine biosynthesis in the seeds of those 
plants. See Karchi, et al.; The Plant J. ; Vol. 3; p. 721; 
(1993); incorporated herein in its entirety by reference. 
However, expression of the lysC gene results in only a 6-7% 
increase in the level of total threonine or methionine in 
the seed. Thus, the expression of the lysC gene in seeds 
has a minimal impact on the nutritional value of those seeds 
and, thus, supplementation of feed containing lysc 
transgenic seeds with amino acids, such as methionine and 
threonine, is still required* 

There are additional molecular genetic strategies 
available for enhancing the amino acid quality of plant 
proteins. Each involves molecular manipulation of plant 
genes and the generation of transgenic plants. 

Protein sequence modification involves the 
identification of a gene encoding a major protein, 
preferably a storage protein, as the target for modification 
to contain more codons of essential amino acids. A critical 
task of this approach is to be able to select a region of 
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the protein that can be Modified without affecting the 
overall structure, stability, function, and other cellular 
and nutritional properties of the protein. The variable 
regionls, i n . polypeptide, as identified throu 9 h sequence 
> analysxs and comparison of related protein species, offer 
possible target sites for such modifications. 

These studies indicate both that it is feasible to 
increase the essential amino acid residues in a seed protein 
by sequence modifications, and that it is important to 
10 select suitable target sites. «¥>ortant to 

The development of DNA synthesis technology allows the 
design and synthesis of a gene encoding a new protein with 
des lrab le essential amino acid compositions. F or examp e 
researchers have synthesized a 2 M -base pair DNA sequence 
» -coding a polypeptide composed o, 80* essential aminolils 
and used it with the nopaline synthetase ( NOS, promoter to 

tuber of transgenic potato has resulted in an accumulation 
of this protein at a level of 0.02* to 0.35* of the total 
Plant prote Th is low level accumulation is possibly du 
proteL Pr0n ° ter Md/0r ^ of the new 

A protein of minor quantity in a plant may contain 
elevated levels of an essential amino acid that is limul 
By ^ enhancing the expression of the gene encoding this 
protein, it may be possible to increase the concentration of 

etLiat ' "* thUS ~ ° f «*• ~ 

essentxal amino aciri T „ . 

acid - In thl s connection, a 10 8-kD 
putative methionine-rich Drofpi - n K 

" protein has recently been 
considered in soybean seeds = ^ 

< eeas as a good candidate for 

^proving the protein quality of soybeans. 

Additionally, recombinant DNA and plant transformation 
technics permit the transfer of geneS between diverse 

12 : riTT; • Thus ' a 9ene encodin9 an — - 

acid rich protein isolated from a specific plant can be 
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introduced into other plants to enhance their protein 
quality. Several plant proteins containing unusually high 
levels of the essential sulfur amino acids and their genes 
have been identified and isolated. They are prime 
candidates for use in protein improvement. 

Tobacco has been used as a test plant to demonstrate 
the feasibility of this approach by transferring a chimeric 
gene containing the bean phaseolin promoter and the cDNA of 
a sulfur-rich protein Brazil Nut Protein ("BNP") , (18 mol% 
methionine and 8 mol% cysteine) into tobacco. Amino acid 
analysis indicates that the methionine content in the 
transgenic seeds is enhanced by 30% over that of the 
untransformed seeds. This same chimeric gene has also been 
transferred into a commercial crop, canola, and similar 
levels of enhancement were achieved. 

However, an adverse effect is that lysine content 
decreases. Additionally, BNP has been identified as a major 
food allergen. Thus it is neither practical nor desirable 
to use BNP to enhance the nutritional value of crop plants. 

This finding marks an area that needs further research. 
It is also useful to point out here that there are 
advantages and disadvantages to each of these approaches. 
While the protein sequence modification and the synthetic 
gene strategies have the flexibility of engineering and 
designing a gene with desirable essential amino acid 
composition, they suffer from the possibility of generating 
unknown structural and biological properties in the protein 
product. Both the heterologous and homologous gene 
approaches enjoy the advantage of utilizing naturally- 
occurring genes. However, the identification of a gene 
encoding a protein rich in a particular essential amino 
acid, if it indeed exists, could be a formidable task. 

There is therefore a need to change the ratio of 
protein classes, without detrimental side effects. 
Endogenous proteins are well adapted for intracellular 
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assembly, targeting and processing. Additionally, a change 
of the protein composition reduces the possibility of 
generating unknown risks for human or animal health because 
all protein compounds are already present in the plant prior 
to modification. However, some endogenous proteins, such as 
BBI, which are rich in essential amino acids, are anti- 
nutritional proteins. 

Based on the foregoing, there exists a need to identify 
endogenous seed storage proteins with increased amounts of 
essential amino acids, which are present in relatively low 
amounts in unmodified seeds, to enhance the nutritional 
value of seeds by genetically modifying the seeds so as to 
over-express genes encoding these proteins. The genetic 
modification should not be accompanied by detrimental side 
effects such as allergenicity, anti-nutritional quality or 
poor yield. 

It is therefore an object of the present invention to 
provide methods for increasing the nutritional content of 
feed. 

It is a further object of the present invention to 
provide methods for genetically modifying seeds so as to 
increase amounts of essential amino acids present in 
relatively low amounts in unmodified seeds. 

It is a further object of the present invention to 
provide methods for introducing endogenous proteins into 
seeds . 

It is a further object of the present invention to 
provide methods for increasing the nutritional content of 
seeds without detrimental side effects such as 
allergenicity, poor yield or anti-nutritional quality. 

SOMMARY OF THE INVENTION 

The methods of the present invention comprise the 
transformation of plant cells by introducing an expression 
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cassette comprising a preselected DMA segment encoding a 
seed storage protein. 

The present invention also provides a fertile 
transgenic soybean plant containing an isolated preselected 
DMA segment comprising a promoter and encoding a seed 
storage protein comprising preselected amino acids under the 
control of the promoter. 

The present invention also provides an isolated and 
purified DNA molecule comprising a preselected DNA segment 
encoding a soybean seed storage protein. 

The present invention also provides an antibody capable 
of specifically binding soybean albumin. 

The present invention also provides methods of 
isolating albumins from seeds. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 depicts the amino-terminal sequences of 
albumin 1, albumin 2, and albumin 3, as determined by Edman 
degradation of proteins isolated from PVDF blots. 

Figure 2 depicts the cDNA sequence (SEQ ID NO: 1) of 
albumin 1 isolated from a soybean seed cDNA library, and the 
corresponding predicted amino acid sequence of albumin 1 
(SEQ ID NO: 2) . 

Figure 3 depicts the cDNA sequence (SEQ ID NO: 3) of 
albumin 3 isolated from a soybean seed cDNA library, and the 
corresponding predicted amino acid sequence of albumin 3 
(SEQ ID NO: 4) . 

Figure 4 depicts the cDNA sequence (SEQ ID NO: 5) and 
the amino acid sequence (SEQ ID NO: 6) of a chimeric albumin 
which comprises sequences from albumin 1 and albumin 3 

Figure 5 termed albumin 1/3 depicts a comparison of the 
amino acid sequences of albumin 1, albumin 3 and albumin 
1/3. 

Figure 6 depicts a plasmid map of p4752. 
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DETAILED DESCRIPTIfM n p THE IMVEMTTHm 

The present invention provides a method for genetically 
modxfyxng seeds to increase the level of at least one 
preselected amino acid in the seed so as to enhance the 
5 nutrxtxonal value of the seeds. The methods comprise the 
introductxon of an expression cassette into referable 
Plant cells to yield transformed pi ant cells. The 
expression cassette comprises a preselected DNA segment 
encode a soybean seed ^ ^ 

> Preselected amino acids, operably lin.ed to a promoter 
functional in plant cells. Promoter 

A fertile transgenic plant is regenerated from the 

hT S seT d CGllS ' an<1 S6edS ^ i5 ° lated fr ° m the 
The seeds comprxse the protein which is encoded by the 

ZTfllTetl\ Dm se9ment and which is produced in - — * 

acid ^ to xncrease the amount of the preselected amino 
add xn the seeds of the transformed plants, relative to the 
amount of the preselected amino acid in the seeds of a 
corresponding untransformed plant, e.g., the seeds of a 
regenerated control plant that is not transformed or 
corresponding untransformed seeds isolated from the 
transformed plant. 

oref P " feraMy ' Che I— «l~t1 amino acid is lysine. More 
preferably, there is an additional preselected ^ 

Even .ore preferably, the additional preselected amino acid 
cysteine or methionine. 

i„t h P r efe " ed •*° dta - rt ° f the I«— t invention is the 
introduce of an expression cassette into regenerate 
soybean cells. Also preferred is the introduction of an 
expres Slon caS sette comprising a preselected DNA segment 
encoding an endogenous polypeptide sequence. 

The present invention encompasses segments having 
S uff lcl ent similarity to the segments disclosed hereina 1 r 
Generally, such sufficient similarity should comprise a" 
least about 60S identity or 60, homology between base pai 
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10 through 474 in albumin 1 (SEQ ID NO: 1), between base 
pairs 28 through 501 in albumin 3 (SEQ ID NO: 3) and between 
base pairs 28 and 501 in albumin 1/3 (SEQ ID NO: 5) . 
Preferably, such sufficient similarity should comprise at 
5 least about 70% identity or 70% homology. More preferably, 
such sufficient similarity should comprise at least about 
80% identity or 80% homology. Even more preferably, such 
sufficient similarity should comprise at least about 90% 
identity or 90% homology. Most preferably, the segments of 

la the present invention are of the sequences disclosed in SEQ 
ID NO: 1, SEQ ID NO: 3 and SEQ ID NO: 5 respectively. 

The present invention also encompasses variations in 
the sequences described above, wherein such variations are 
due to site-directed mutagenesis, or other mechanisms known 

15 in the art, to increase or decrease levels of selected amino 
acids of interest. For example, site-directed mutagenesis 
to increase levels of lysine, methionine and/or cysteine, 
and/or to decrease levels of asparagine and/or glutamine is 
a preferred embodiment • 

20 The present invention also provides a fertile 

transgenic plant. The fertile transgenic plant contains an 
isolated preselected DNA segment comprising a promoter and 
encoding a seed storage protein comprising preselected amino 
acids under the control of the promoter. The DNA segment is 

25 expressed as the seed storage protein so that the level of 
preselected seed storage protein amino acids in the seeds of 
the transgenic plant is increased above the level in the 
seeds of a plant which only differ from the seeds of the 
transgenic plant in that the DNA segment or the encoded seed 

30 protein is under the control of a different promoter. The 
DNA segment is transmitted through a complete normal sexual 
cycle of the transgenic plant to the next generation. 

Also provided is an isolated and purified DNA molecule 
comprising a preselected DNA segment encoding a soybean seed 

35 storage protein. A most preferred embodiment of the 
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invention is a preselected DMA segment encoding a soybean 
albumin. See e.g. Shewry, et al.; The Plant Cell : Vol. 7; 
No. 7; pp. 945-956; (1995); incorporated herein in its 
entirety by reference. 

The present invention also provides an expression 
cassette comprising a preselected DNA segment encoding a 
soybean seed storage protein, operably linked to a promoter 
functional in a host cell. Preferred promoters useful in 
the practice of the invention are those seed-specific 
promoters that allow expression of the preselected DNA 
segment selectively in seeds to avoid any potential 
deleterious effects associated with the expression of the 
preselected DNA segment in non-seed organs. 

Other embodiments of the invention include plants, 
Plant parts, seeds and microorganisms transformed with the 
preselected DNA segment encoding a seed storage protein 
Preferably, the seed storage protein is an albumin. More 
preferably, the seed storage protein is a soybean albumin. 

Other embodiments of the present invention also include 
a chimera with increased levels of preselected amino acids. 

in a preferred embodiment of the present invention, a 
method is provided for the simple, rapid, and reliable 
production of transgenic soybean P l ants with increased 
accumulation of lysine, in the seeds produced thereby m a 
25 more preferred embodiment, increased accumulation of 
methionine and/or cysteine occurs in addition to increased 
accumulation of lysine. The method is genotype-independent 
and shows a substantial, unexpected improvement over 
previously used systems. 

30 The present invention also provides methods for 

isolating and purifying 2S albumins comprising the 
separation of albumins from contaminants by specifically 
interacting the albumins with the matrix of a carbohydrate 
resin, preferably a dextran resin, even more preferably 

35 Sephadex G25. The above methods for isolation and 
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purification are unexpected given the molecular sieve 
characteristics of the resin. The sp cific interaction 
between the albumins and the matrix has applications useful 
for batch processes . 

As used herein, a ''preselected DNA segment" means an 
exogenous or recombinant DNA sequence or segment that 
encodes a soybean seed storage protein, wherein the seed 
storage protein is preferably not a functional protease 
inhibitor, not a functional a amylase inhibitor and not a 
lectin. 

A preferred seed storage protein of the invention is 
one that has an increased content of lysine as well as 
sulfur containing amino acids, i.e., methionine and/or 
cysteine. The choice of the preselected DNA segment and 
amino acid is based on the amino acid composition of the 
protein encoded by the preselected DNA segment, and the 
ability of the protein to accumulate in seeds. Moreover, 
the amino acid composition of the protein can be manipulated 
by methods, such as site-directed mutagenesis of the 
preselected DNA segment encoding the protein, so as to 
result in expression of a protein that is increased in the 
amount, i.e., content, of a particular amino acid. A 
preferred embodiment of the invention is a preselected DNA 
segment encoding a soybean seed storage protein that has an 
elevated amount of lysine, and methionine and/or cysteine, 
such as a preselected DNA segment encoding a soybean 
albumin. Because an endogenous protein is utilized, the 
possibility of generating unknown risks for human and/or 
animal health is reduced. 

As used herein, the term "high lysine content protein" 
means that the protein has at least about 7% lysine, more 
preferably at least about 10% lysine, even more preferably 
at least about 12% lysine, and most preferably at least 
about 13% lysine. In a preferred embodiment, the high 
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lysine content protein is also a high sulfur content 
protein . 

As used herein, the term "high sulfur content protein" 
means that the protein contains methionine and/or cysteine 
in addition to lysine, at levels indicated hereinafter. The 
high sulfur content protein has at least about 6% methionine 
and/or cysteine, preferably at least about 9% methionine 
and/or cysteine, and more preferably at least about 11% 
methionine and/or cysteine. 

As used herein, "increased" or "elevated" levels or 
amounts of preselected amino acids in a transformed plant 
are levels which are greater than the levels or amounts in 
the corresponding untransformed plant. For example, the 
average methionine content in soybean seed proteins is about 
1.4%, the average cysteine content in soybean seed proteins 
is about 1.4%, and the average lysine content in soybean 
seed proteins is about 6.0% (George, et al.; J. Aqric. Food 
Chenu; Vol. 34; p. 224; (1991, ; incorporated herein in its 
entirety by reference). Thus, the expression of soybean 
albumin 1 having SEQ ID NO: 2, which has about 12% of a 
combination of methionine and cysteine and about 10% lysine 
xn seeds results in an increase in the level or amount of 
methionine, cysteine and lysine in those seeds 
Furthermore, the expression of soybean albumin 3, having SEQ 
ID NO: 4, which has about 12% of a combination of methionine 
and cysteine and about 10% l ys i ne , in seeds results in an 
increase in the level or amount of methionine, cysteine and 
lysxne in those seeds. The amino acid composition of a 
protem can be determined by methods well known to the art 

increased amounts of preselected amino acids other than 
lysxne m a transformed plant are preferably at least about 
15 to 30%, preferably at least about 30 to 50%, and most 
preferably about 50 to 100%, greater than the amounts of the 
preselected amino acid in a non- trans formed plant 
increased amounts of preselected lysine in a transformed 
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plant are preferably at least about 5-10%, more preferably 
at least about 10-15%, even more preferably at least about 
15-25%, most preferably at least about 25-50% greater than 
the amounts of lysine in a non- trans formed plant. 

As used herein, "genetically modified plant" means a 
plant which comprises a preselected DNA segment which is 
introduced into the genome of the plant by transformation. 
The term "wild type" refers to an untransformed plant i.e., 
one where the genome has not been altered by the 
introduction of the preselected DNA segment. 

As used herein, "plant" includes but is not limited to 
plant cells, plant tissue and plant seeds. For the present 
invention, preferred plants include soybean, canola, 
sunflower, sorghum and corn. More preferred plants include 
soybean and corn. The most preferred plant is soybean. 

As used herein with respect to a preselected DNA 
segment encoding a protein, the term "expresses" means that 
the preselected DNA segment is incorporated into the genome 
of the cells, so that the product encoded by the preselected 
DNA segment, e.g., a sulfur-rich protein such as albumin, is 
produced within the cells. For example, novel plants 
resulting from expression of a preselected DNA segment 
encoding an albumin contain extractable levels of the 
albumin of at least about 3%, preferably at least about 5%, 
more preferably at least about 10%, and even more preferably 
at least about 20%, of the total protein in the seed. 

The class of plants which can be used in the method of 
the invention is generally as broad as the class of seed- 
bearing higher plants amenable to transformation techniques, 
including both monocotyledonous and dicotyledonous plants. 
Seeds derived from plants regenerated from transformed plant 
cells, plant parts or plant tissues, or progeny derived from 
the regenerated transformed plants, may be used directly as 
feed or food, or further processing may occur. in the 
practice of the present invention, the most preferred plant 
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seed x. selected from that of soybean, canola, sunflower, 
sorghu* and corn. More preferably, the plant seed is that 
of corn or soybean, most preferably that of the soybean 
ma*. Ihe transformation of the plants in accordance 
with the xnvention may be carried out in essentially any of 
the various ways known to those skilled in the art of plant 
molecular biology. These include, but are not limited to 
mrcropro.ectile bombardment, microinjection, electroporation 
of protoplasts or cells comprising partial cell walls, Z 
10 Affrobacterrum-mediated DNA transfer. 

As used herein, "recombinant" DNA is a DNA sequence or 

zzr has been isoiated f ™ • " 

B isolatlT d herei "' <>iSOlatad " ~ »« =«her physically 

of he 8 ° r '» -tro on the basis 

of the sequence of an isolated DNA segment 

genes'^ T here1 "' "™* * Seed protei '> *— 

and homologous to the 2S albumin seed protein family. see 

ZZJT* ' incorporated h — in - * 

As used herein, « 2S soybean albumin" means a GWcine 
seed protein whose genes encode peptide precursors whrcfa^ 
homologs of the albumins. 

" »rot2\r a ° D \ in ™ ti0n Pr ° VldeS f ° r the of a 

protexn of preseiected amino acid composition in a seed at 

ratfe r to reduc ° ° r ° b * iate f ~ d 

~ Pr0tei "' WhtCh " ^ a preseiected DNA 

segment of the invention, is a seed storage proteif 
» Because seed storage proteins normally accumulate Tn seed' 
overexpression of these proteins in seed will not have t 
overcome incompatibility with the assembly, targeting and 
processing mechanisms in the ceU. In addition t e e t 

reasons in comparrson with wild type seeds. A preferred 
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embodiment of the invention includes a seed storage protein 
rich in lysine as well as sulfur-containing amino acids. One 
example of such a protein is an albumin. To enhance 
expression of a protein of preselected amino acid 
composition in a seed at a level to increase the level of 
the preselected amino acid in the seed, expression cassettes 
with seed-specific promoters can be employed. 

I- DNA USED FOR TRANSFORMATION 
DNA-encoding seed storage protein (s) useful for 
introduction into plant cells includes DNA that has been 
derived or isolated from any source, that may be 
subsequently characterized as to structure, size and/or 
function, chemically altered, and later introduced into the 
plant. An example of DNA "derived" from a source, would be 
a DNA sequence or segment that is identified as a useful 
fragment within a given organism, and which is then 
synthesized in essentially pure form. An example of such 
DNA "isolated" from a source would be a useful DNA sequence 
that is excised or removed from the source by chemical 
means, e.g., by the use of restriction endonucl eases, so 
that it can be further manipulated, e.g., amplified, for use 
in the invention, by the methodology of genetic engineering. 

Therefore, useful DNA includes completely synthetic 
DNA, semi-synthetic DNA, DNA isolated from biological 
sources, and DNA derived from RNA. The DNA isolated from 
biological sources, or DNA derived from RNA, includes, but 
is not limited to, DNA or RNA from plant genes, and non- 
plant genes such as those from bacteria, yeasts, animals or 
viruses. The DNA or RNA can include modified genes, 
portions of genes, or chimeric genes, including genes form 
the same or different genotype. The term "chimeric gene" or 
"chimeric DNA" is defined as a gene or DNA sequence or 
segment comprising at least two DNA sequences or segments 
from species which do not recombine DNA under natural 
conditions, or which DNA sequences or segments are 
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positioned or linked in . manner „ hicn does ^ ^ 
occur i„ the native genome of untransformed plant. Thus, it 
« «ithi„ the scope of the invention to isolate a 
preselected DHA segment from , given soybean genotype, a.d 
to subsequently introduce at least one copy of the 
preselected DMA segment into the same genotype 

idenA T k l6Cted "* Se9ment ° f "» inTCntion «» * 
or prol b \ Stam " a " ethods ' -"«"-«t protocols, 

or probes, directed to the isolation of particular 
» nucleotide or amino acid sequences ti.. Particular 

u sequences. The preselected Dm 
segment can be identified by obtaining and/or screening of a 
OKA or cDKA library generated from nucleic acid derived fLm 
a particular cell type, cell line, primary cells, or tissue 
screening for » fragments that encode all or a portion o, 
the preselected DNA segment can be accomplished by^creening 

a probe of the preselected DNA segment fro. other organisms 
or by screening plagues fro. a cD«A expression library for 
bind.ng to antibodies that specifically recognise the 
protein encoded by the preselected DNA segment. dna 

i hybridi2e to a presei ~ ted » * J? 

from other organisms and/or plaques carrying DNA fragffients 
that are immunoreactive with antibodies to the protein 
encoded by tbe preselected » segment can be subclonL n o 
a vector and sequenced and/or used as probes to identify 
other cDNA or genomic sequences encoding all or , port 2 or 
the preselected DMA segment. 

Portions of the genomic copy or copies of the 

ZZ TT, CM te P "" a11 ^ "quenced and 

w " Standard neth ° dS inClUdi " 9 « ~e 

homology to other homologous genes or by comparisol 
encoded ammo acid sequences tQ ^ ^ 

Once portions of the preselected DNA segment are ide« f Z' 
complete copies of the preselected Dm segment c be 
Obtained by standard methods, including cloning £ 
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polymerase chain reaction (PGR) synthesis using 
oligonucleotide primers complementary to the preselected DNA 
segment. The presence of an isolated full-length copy of 
the preselected DNA segment can be verified by comparison of 
its deduced amino acid sequence with the amino acid sequence 
of native polypeptide sequences. 

The preselected DNA segment encoding the seed storage 
protein can be modified to increase the content of 
particular amino acid residues in that protein by methods 
well known to the art, including, but not limited to, site- 
directed mutagenesis. Thus, derivatives of naturally 
occurring proteins can be made by nucleotide substitution of 
the preselected DNA segment encoding that protein so as to 
result in a protein having a different amino acid at the 
position in the protein which corresponds to the codon with 
the nucleotide substitution. The introduction of multiple 
amino acid changes in a protein can result in a protein 
which is significantly enriched in a preselected amino acid. 

The present invention thus provides a DNA molecule 
comprising a preselected DNA segment encoding a seed storage 
protein. The preselected DNA segment can encode any seed 
storage protein including, but not limited to, the 2S, 7S 
and lis seed storage proteins, with or without modification 
of the sequence encoding those proteins. The skilled 
artisan will recognize that the choice of the protein 
encoded by the preselected DNA segment will be based on the 
amino acid composition of the protein and its ability to 
accumulate in seeds. The amino acid can be chosen for its 
nutritional value to produce a value-added trait to the 
plant or plant part. Amino acids desirable for value-added 
traits, as well as a source to limit synthesis of an 
endogenous protein include, but are not limited to, 
methionine, cysteine, and lysine. 

Also provided are methods for increasing the level of 
at least one preselected amino acid in seeds by expressing a 
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preselected DNA segment encoding a protein in seeds. 
Preferably, the preselected amino acid is lysine. More 
preferably, a second preselected amino acid is also included 
in the present invention. Even more preferably, the second 
preselected amino acid is methionine or cysteine. 
Expression of the preselected DNA segment, or multiple 
copies of the preselected DNA segment, can increase the 
level of the protein encoded by the preselected DNA segment 
in the seeds and, thus, the level of the preselected amino 
acid which has been incorporated into the protein encoded by 
the preselected DNA segment. Methods and compositions are 
provided for producing plant cultures, plant tissues, plants 
and seeds that comprise an expression cassette comprising a 
preselected DNA segment encoding a protein. The present 
invention provides a method of genetically engineering 
plants so that the plants produce seeds with increased 
levels of at least one preselected amino acid, such that 
plants and seeds can sexually transmit this trait to their 
progeny. 

In a preferred embodiment, the protein encoded by the 
preselected DNA segment is a sulfur rich 2S seed storage 
protein, such as albumin, m a more preferred embodiment of 
the invention, the preselected DNA segment encodes an 
endogenous 2S soybean albumin. By way of example, and not 
limitation, those skilled in the art will readily appreciate 
that the 2S albumin gene from other organisms may be 
substituted for the soybean 2S albumin protein. See, for 
example, Coulter, et al.; J. Exp. Bot. .- Vol. 41; p. 1541- 
(1990); incorporated herein in its entirety by reference. 

Other examples of sulfur-rich plant proteins within the 
scope of the invention include plant proteins enriched in 
cysteine but not methionine, such as the wheat endosperm 
purothionine (Max and Jones; Can. J. Biochp™ .. vol. 22; p 
83J; (1976); incorporated herein in its entirety' by 
reference), and the pea l ow molecular weight albumins 
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(Higgins, et al.; J. Biol. Chem. ; Vol. 261; p. 11124; 
(1986); incorporated herein in its entirety by reference). 
Such proteins also include methionine-rich plant proteins 
such as from sunflower seed (Lilley, et al.; In: Proceedings 
5 of the World Congress on Vegetable Protein Utilization in 
Human Foods and Animal Feedstuff s ; Applewhite, H. (ed.); 
American Oil Chemists Soc; Champaign, IL; pp. 497-502; 
(1989); incorporated herein in its entirety by reference), 
corn (Pedersen, et al.; J. Biol. Chem. p. 261; p. 6279; 

10 (1986); Kirihara, et al.; Gene , Vol. 71; p. 359; (1988); 
both incorporated herein in its entirety by reference), and 
rice (Musumura, et al.; Plant Mol. Biol. ; Vol. 12; p. 123; 
(1989); incorporated herein in its entirety by reference). 
Expression Cassettes and Expression Vectors 

15 According to the present invention, a preselected DNA 

segment encoding a protein, such as a seed storage protein, 
is identified, isolated, and combined with at least a 
promoter functional in a host cell, e.g., a plant cell, to 
provide a recombinant expression cassette. The construction 

20 of such expression cassettes which may be employed in 
conjunction with the present invention are well known to 
those of skill in the art in light of the present 
disclosure- See, e.g., Sambrook, et al.; Molecular Cloning: 
A Laboratory Manual ; Cold Spring Harbor, New York; (1989); 

25 Gelvin, et al.; Plant Molecular Biology Manual ; (1990) ; 
Plant Biotechnology: Commercial Prospects and Problems , eds 
Prakash, et al.; Oxford & IBH Publishing Co.; New Delhi, 
India; (1993); and Heslot, et al.; Molecular Biology and 
Genetic Engineering of Yeasts ; CRC Press, Inc., USA; (1992); 

30 each incorporated herein in its entirety by reference. 

Promoters 

Preferred expression cassettes of the invention will 
generally include, but are not limited to, a seed-specific 
promoter. Examples of seed-specific promoters include 
35 promoters of seed storage proteins which express these 
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proteins in seeds in a highly regulated manner (Thompson, et 
al.; BioEssays ; Vol. 10; p. 108; (1989); incorporated herein 
in its entirety by reference), such as, for dicotyledonous 
plants, a bean 0-phaseolin promoter, a napin promoter, a 0- 
5 conglycinin promoter, and a soybean lectin promoter. For 
monocotyledonous plants, promoters useful in the practice of 
the invention include, but are not limited to, a maize 15 kD 
zein promoter, a 22 JcD zein promoter, a y - 2e in promoter, a 
waxy promoter, a shrunken 1 promoter, a globulin 1 promoter, 
10 and the shrunken 2 promoter. However, other promoters 
useful in the practice of the invention are known to those 
of skill in the art. 

II- DELIVERy OF DMA. Tf) rnsT.T.c 
The expression cassette or vector can be introduced 
into prokaryotic or eukaryotic cells by currently available 
methods. For example, the expression cassette or vector can 
be introduced into plant cells by methods including, but not 
limited to, Agrobacteri um-mediated transformation, 
electroporation, microprojectile bombardment, 

microinjection,- infectious viruses or viroids, the use of 
liposomes and the like, all in accordance with well-known 
procedures. Plant cells useful for transformation include 
cells cultured in suspension cultures, callus, embryos, 
meristem tissue, pollen, and the like. Transformed cells 
can be selected typically using a selectable or screenable 
marker encoded on the expression vector. 

introduction and expression of foreign genes in 
dicotyledonous plants such as soybean, tobacco, potato and 
alfalfa has been shown to be possible using the T-DNA of the 
tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens. 
Using recombinant DNA techniques and bacterial genetics, a 
wide variety of foreign DNAs can be inserted into T-DNA in 
Agrobacterium. Following infection by the bacterium 
containing the recombinant Ti plasmid, the foreign DNA is 
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inserted into the host of plant chromosomes, thus producing 
a genetically engineered cell and eventually a genetically 
engineered plant. A second approach is to introduce root- 
inducing (Ri) plasmids as the gene vectors. 

While Agrobacterium appear to preferably attack dicots, 
many important crop plants including maize, wheat, rice, 
barley, oats, sorghum, millet, and rye are monocots and are 
not known to be easily susceptible to transformation by 
Agrobacterium. The Ti plasmid, however, may be manipulated 
in the future to act as a vector for monocot plants. 
Additionally, using the Ti plasmid as a model system, it may 
be possible to artificially construct transformation vectors 
for monocot plants. Ti-plasmids might also be introduced 
into monocots by artificial methods such as microinjection, 
or fusion between monocot protoplasts and bacterial 
spheroplasts containing the T-region, which can then be 
integrated into the plant nuclear DNA. Other transformation 
methods are readily available to those skilled in the art. 
HI- REGENERATION AND ANALYSIS OF TRANSFORMANTS 

Following transformation, regeneration is involved to 
obtain a whole plant from transformed cells and the presence 
of preselected DNA segment (s) or *transgene (s) " in the 
regenerating plant detected by assays. The seed derived 
from the plant is then tested for levels of preselected 
amino acids. Depending on the type of plant and the level 
of gene expression, introduction of the preselected DNA 
segment into the plant can enhance the level of preselected 
amino acids in an amount useful to supplement the 
nutritional quality of those seeds. 

Techniques for regenerating plants from tissue culture, 
such as transformed protoplasts or callus cell lines, are 
known in the art. For example, see Phillips, et al.; Plant 
Cell Tissue Organ Culture ; Vol. 1; p. 123; (1981); 
Patterson, et al.; Plant Sci . ; Vol. 42; p. 125; (1985); 
Wright, et al.; Plant Cell Reports ; Vol. 6; p. 83; (1987); 
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and Barwale, et rianta.. Vol . 167; p _ ^ 

mcorporated herein i„ its entlrety by 

flection of an appropriate -ethod is within the skin of 

the art. 

5 Examples of the practice of present invention detailed 

her ei „ relate specifically to soybean plants and expression 
vectors operable in dlC ots. However, the present invention 

ilte°d a h 31516 t0 ° ther PUntS - ^ ~*«. 
■0 "* «pable of operation in 

■0 cells of many dicotyledonous plants both in tissue culture 
and in whole plants. The invention disclMed herein 
operable » dicotyledonous species to transform indivi u" 
Plant cells and to achieve full, intact plants in dicot 
» P n cT eS ^ ^ »~ *«- transformed 

protin: " "* WhlCh "~ — — — -ora,e 

^ th? rnstr: pr ;z e z:r^z ar : ~ 

,, nBaf . a „ F celAs stably transmitted 

20 r ' SeXUaUy) '° "» M " oration of cells 

LLt •" ^ ^ ShOUld be CaPable ° f mtroduc C 
" ln aiM '' — messing a preselected »a segment I 

p^r « s^rrt:: eeii t s o£ piants - - 

25 sexual - • . P ° D tC P ro 9 e *y *>Y normal 

sexual transmission. 

To confirm the presencp ^ +-k 

pj-esence or the preselected nua 
segment(s) or "transgenelsl- i„ selected Ma 

seeds or «»«•<«> "> the regenerating plants, or 

varlt v of y 6d ^ »~t- P^nt. a 

» for ex " e aS « ayS . ^ 

those oTl;,, biological- assays well known to 

those of skill ln the art, such as Southern and Northern 

Mottxn, and PCK; "biochemical- assays, such as detect", 

IL r~ Pr ° teln PIOdUCt ' i—logica! 

means (ELISAs and Western bloto «r x„ 
15 r,i a „4- DJ-otsj or by enzymatic function- 

's plant part assays, such as leaf seed or r - ™ ct "»' 

ar ' seed or root assays; and 
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also, by analyzing the phenotype of the whole regenerated 
plant . 

Whereas DNA analysis techniques may be conducted using 
DNA isolated from any part of a plant, RNA may only be 
expressed in particular cells or tissue types and hence it 
will be necessary to prepare RNA for analysis from these 
tissues. PCR techniques may also be used for detection and 
quantitation of RNA produced from introduced preselected DNA 
segments. In this application of PCR it is first necessary 
to reverse transcribe RNA into DNA, using enzymes such as 
reverse transcriptase, and then through the use of 
conventional PCR techniques amplify the DNA. In most 
instances PCR techniques, while useful, will not demonstrate 
integrity of the RNA product. Further information about the 
nature of the RNA product may be obtained by Northern 
blotting. This technique will demonstrate the presence of 
an RNA species and give information about the integrity of 
that RNA. The presence or absence of an RNA species can 
also be determined using dot or slot blot Northern 
hybridizations. These techniques are modifications of 
Northern blotting and will only demonstrate the presence or 
absence of an RNA species. 

While Southern blotting and PCR may be used to detect 
the preselected DNA segment in question, they do not provide 
information as to whether the preselected DNA segment is 
being expressed. Expression may be evaluated by 

specifically identifying the protein products of the 
introduced preselected DNA segments or evaluating the 
phenotypic changes brought about by their expression. 

Assays for the production and identification of 
specific proteins may make use of physical-chemical, 
structural, functional, or other properties of the proteins. 
Unique physical-chemical or structural properties allow the 
proteins to be separated and identified by electrophoretic 
procedures, such as native or denaturing gel electrophoresis 
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or isoelectric focusing, or by chromatographic techniques 
such as ion exchange or gel exclusion chromatography. The 
unxque structures of individual proteins offer opportunities 
for use of specific antibodies to detect their presence in 
I formats such as an EL ISA assay. Combinations of approaches 
may be employed with even greater specificity such as 
Western blotting in which antibodies are used to locate 
xndxvidual gene products that have been separated by 
electrophoretic techniques. Additional techniques may b e 
employed to absolutely confirm the identity of the product 
of xnterest such as evaluation by amino acid sequencing 
followxng purification. Although these are among the most 
commonly employed, other procedures may be additionally 
used. 

Very frequently, the expression of a gene product is 
determined by evaluating the phenotypic results of its 
expression. These assays also may take many forms, 
including but not limited to, analyzing changes in the 
chemxcal composition, morphology, or physiological 
properties of the plant. chemical composition may be 
altered by expression of preselected DNA segments encoding 
storage proteins which change amino acid composition and may 
be detected by amino acid analysis. 

Breeding techniques useful in the present invention are 
well known in the art. 

The present invention has been described with reference 
to various specific and preferred embodiments and will be 
further described by reference to the following detailed 
examples. it i s understood, however, that there are many 
extensions, variations, and modifications on the basic theme 
of the present invention beyond that shown in the examples 
and description, which are within the spirit and scope of 
the present invention. 
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EXAMPLE 1 

Isolation and Characterization of Soybean 2S Albumins 

Soybean plants ( G. max Merr. ) varieties are grown in 
the greenhouse or in the field. If not otherwise stated, 
reagents and laboratory supplies are obtained from Sigma 
Chemical Co. (St. Louis, MO) or Baxter (McGaw Park, IL) . 
Protein concentrations are estimated either according to 
Bradford (BioRad® protein assay, BioRad®, Hercules, CA) or 
with a modified Lowry assay (DC protein assay, BioRad®) 
with bovine serum albumin (Pierce, Rockford, IL) as a 
standard. 

The present method comprises the steps of; 

a) protein extraction from soybean meal; 

b) size exclusion chromatography of the protein 
15 extract; 

c) collection of albumin-containing fractions; 

d) separation of albumins from other protein 
contaminants by specifically interacting the albumins to the 
matrix of a resin; and 

20 e) ion-exchange chromatography to separate the 

individual albumins. 

SDS polyacrylamide gel electrophoresis ("PAGE") is 
performed using the Tris-Tricine buffer system developed by 
Schagger and von Jagow. See Schagger, H. and von Jagow, G., 
25 Anal. Biochenu Vol. 166, p. 368 (1987); incorporated herein 
in its entirety by reference. For routine purposes 
polypeptides are separated in 16.5% Mini-Protein II precast 
mini-gels (80x73 mm Ready Gels, BioRad®, Richmond, CA) or, 
when a superior resolution of polypeptides in the molecular 
weight range between 2 and 25 kDa is required, in 170x150 mm 
8-22% polyacrylamide gradient gel using a Model V16 
electrophoresis apparatus (GibcoBRL®, Gaithersburg, MD) . 
Protein bands are detected by staining with Coomassie 
brilliant Blue R 250. 
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When indicated, after electrophoretically separating 
the proteins, the separated polypeptides are 
electrotransferred to polyvinylidene difluoride (PVDF) 
membranes (Immobilon PSQ®, Millipore, Bedford, MA) using a 
semi-dry electroblotter (SemiPhor® TE70, Hoefer, San 
Francisco, CA) as described by Matsudaira ( J. Biol. Chem. .. 
Vol. 262; p. 10035; (1987); incorporated herein in its 
entirety by reference) . Several precautions are undertaken 
to prevent blocking of N-terminal amino groups and to 
minimize modifications of amino acid side chains prior to 
amino acid sequencing. The Tris/Tricine gels, including the 
stacking gel, are cast 3-7 days prior to the protein 
separation and stored sealed at 4°C. Immediately before 
separation, gels are pre-run at 2V/cm for 15 hours with 0.1% 
SDS, 0.75M Tris/HCl, pH 8.45 (anode buffer) and 0.1% SDS, 1M 
Tris/HCl, pH 8.45 (cathode buffer). Following 
electrotransfer of polypeptides to PVDF (see above) and 
staining with Coomassie Blue, the blots are washed 
extensively with water and dried. Polypeptide bands of 
interest are carefully excised from the membranes and stored 
in microcentrifuge tubes at 4°C until needed. N-terminal 
sequence is obtained from Immobilon PSQ membranes by using 
an Applied Biosystems 477A Protein sequencer in the Protein 
Analysis Laboratory of the University of Iowa (Iowa City 
25 IA) . 

Amino acid analysis is carried out on a Beckman 6300 
analyzer according to standard procedures. Methionine and 
cysteine are determined as methionine sulfone and cysteic 
acid after performic acid oxidation. Isoelectric focusing 
30 of proteins is performed in pre-cast slab gels (pH 
performance range 3.5-6.5, Novex, San Diego, CA) with Novex 
Low Range IEF protein standards according to the 
manufacturer' s recommendations . 
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To determine whether an isolated protein contains N- 
linked glycans, Concanavalin A-horseradish peroxidase 
staining of protein blots (see above) is carried out as 
described by Faye and Chrispeels ( Anal. Biochem. ; Vol. 149, 
p. 218; (1985); incorporated herein in its entirety by 
reference) with the modification that horseradish peroxidase 
activity is visualized by chemiluminescence (ECL kit, 
Amersham, Arlington Heights, IL) . 

For N-Glycosidase F (Boehringer Mannheim, Indianapolis, 
IN) treatment (0.1 U/10 ul, 15 hours at 37°C) protein 
samples (10 ug/10 ul) are denatured by 2 min. at 95°C in 
0.1% SDS, 200 mM NaCl, 20 mM Tris/HCl, pH 8.5, cooled to 
4°C, supplemented to 1% Triton X-100 and incubated for 15 
min. at room temperature prior to enzyme addition. 

Purificatio n and Characterization of Lvsine-rich 
and Sulfur-rich Soybean 28 Albumins 

Transgenic soybean seed expressing a methionine-rich 2S 
seed storage protein from Brazil Nut ( Bertholletia excel sa ) 
(«BNP' r ) shows a reduction in the levels of the sulfur-rich 
endogenous Bowman-Birk inhibitor (Kollipara, K.P. and 
Hymowitz, R. ; J. Agri. Food ; Vol. 40; pp. 2356-2363; (1992); 
incorporated herein in its entirety by reference) and the 
reduction of an unknown 14 kDa protein. To determine 
whether the unknown 14 kDa protein is a methionine-rich seed 
storage protein, seed proteins from wild type seeds and BNP 
transgenic seeds are electrophoretically separated and 
electrotransf erred to PVDF membranes, and then the membranes 
are probed with iodo[ 14 C] acetic acid (ICN Radiochemicals, 
Irvine, CA) , pH 2.0, according to the method of de Lumen and 
K* 10 ( J. Agric. Food Chem. ; Vol. 35; p. 688; (1987); 
incorporated herein in its entirety by reference) . An 
autoradiogram of this gel blot shows that the 14 kDa protein 
is a methionine-rich protein. This protein may belong to a 
family of methionine-containing peptides previously observed 
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by Kho and de Lumen (Plant Food Hum, m,,^ . Vo l. 38; p 287- 
(1988); incorporated herein in its entirety by reference,' 
using the same technique. 

To purify this protein, mature dry seed of soybean 
5 (Glycine max, is ground into a fine meal, defatted by 
extraction with hexane (1:1 w /v, and vacuum dried. 100 g of 
defatted flour is homogenized in a Waring blender for 5 min 
at 4 o C with 400 ml 10% DMS 0/ 0.5% n-butanol, 100 mM KCL, 83 
mM sodium acetate buffer, pH 5.2, (albumin extraction 
10 buffer, . All foll owing steps are carried out either on ice 
or at 4° C. 

The slurry is tutored through Miracloth® (Calbiochem, 
LaJolla, CA) and centrifuged at 6000 Xg lor 15 min. The 
recovered supernatant is dialled (Spectra/por 7, hkco 3500, 
» Baxter, HcGaw Part, il, extensively against 0.5» n-butanol, 
100 m kcl, 83 ■» sodium acetate buffer, p„ 5.2 and 
concentrated in the diaXysis bags to about 100 ml with dry 
polyethyleneglycol (peg 8000,. Precipitated contaminating 
globulin proteins are removed by centrifugation at 6000 Xg 
» for 15 min. and by filtration through a 0.45 m membrane 
The resulting albumin extract contains approximately 20» of 
the total seed protein. 5-10% of the albumin fraction is 
represented by the U *Da polypeptides which comprises 
approx 1-2, of the total soybean seed protein ,0.5-1* of 
the seed weight, in wild-type seeds. The extractability in 
dilute acidie buf£er clMsifies ^ u ^ prote . ns ^ 

albumrns (Osborne, The Veoet.K,. ^ 
«ed.,, ^ndon I1924); incorporated herein ^ 

reference,. The 14 ». protein dissociates in SDS PAGE 

TZ T Cln : COndiUOnS lnt ° tW ° ^»*t"-, apparently 
of 1» M,a and of 5 *Da respectively, indicating linkage by 
disulfide bridges in the nonprotein. 

Five ml of the concentrated albumin extract (cone 
approx. 20 mg/ml,, is further fractionated using an superde* 
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75 HiLoad 26/60 column (Pharmacia, Uppsala, Sweden) which is 
run with the extraction buffer. The flow rate is maintained 
at 1 ml/min. and fractions of 4 ml are collected and 
analyzed by PAGE. The fractions containing the putative 
5 albumin (fraction 33-35, 18 mg protein), obtained with 
approximately 50% purity and Kunitz trypsin inhibitor (KTI) 
(Kollipara supra ) as the major contaminant, are adjusted to 
pH 8.5 with Tris HC1 (1 M) and chromatographed using a 100 
ml Sephadex G 25 sf column (Pharmacia, Uppsala, Sweden) with 

10 50 mM sodium acetate, pH 5.2, running buffer at 1 ml/min. 

The 14 kDa proteins exhibit, under these conditions, an 
unexpected interaction with the dextran matrix of the column 
and separate from its protein contaminants as a single peak 
with more than 95% purity. A similar specific interaction 

15 with the dextran matrix can be observed with the 2S albumin 
from Brazil Nut and can be used for its purification in a 
single step. Other albumins also behave in a similar 
manner . Other carbohydrate matrices known to the skilled 
artisan may similarly be used in the process. Though the 

20 above mentioned chromatography step has been specifically 
described, it can be replaced by other techniques involving 
specific interactions, e.g., but not limited to batch 
processes . 

The above-obtained putative albumin fraction is dialyzed 
25 (Spectra/por 7) for 15 hours against 20 mM Tris/HCl pH 8.5 
and concentrated in the dialysis bags to about 0.5 mg/ml 
protein with dry PEG 8000. 5 mg of the desalted protein is 
filtered through a 0.2 *ua membrane filter and fractionated 
further by ion-exchange chromatography using a MonoQ HR 5/5 
30 (Pharmacia, Uppsala, Sweden) column, developed in a gradient 
of 0-750 mM NaCl in 20 mM Tris/HCl, pH 8.5 buffer. Three 
separate peaks, elute at 180 mM NaCl (designated albumin 1), 
250 mM NaCl (designated albumin 2), and 360 mM NaCl 
(designated albumin 3) . Albumin 3 (A13) appears to be the 
35 major form, i.e. it contains more than 90% of the protein in 
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all three fractions combined, whereas albumin 1 (All) and 
albumin 2 (A12) are found to be each approximately 20 times 
less abundant among the soybean seed proteins when compared 
to Al 3. All three albumin fractions are obtained at near 
homogeneity based on SDS-PAGE. After treatment with the 
reducing agent 2-mercapto-ethanol, each of the three albumin 
forms dissociated in SDS PAGE into two smaller polypeptides 
of different length, indicating the presence of disulfide 
bonds in the native protein. The sizes of the larger 
peptides in each of the reduced albumins appear to be 
similar (10 kDa), whereas the shorter peptides appear to be 
of different sizes. The All smal i chain has an estimated 
molecular weight of 4.5 kDa, the A12 small chain of 4.8 kDa 
and the A13 small chain of 5.1 kDa, respectively. 

PVDF blots of the electrophoretically separated A12 
large chain, A12 small chain, A13 large chain and A13 short 
cham are subjected to amino acid analysis (Table 1} Both 
albumins contain the predicted high contents of methionine 
and xn addition, a surprisingly high percentage of lysine 
Although it appears that the amino acid compositions of the 
two albumins are generally similar, some clear differences 
with some amino acids are observed. 
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TABLE 1 
AMINO ACID COMPOSITION 

A12 A13 



5 




5 kDa 
Peptide 
Mole % 


11 kDa 
Peptide 
Mole % 


5 kDa 
Peptide 
Mole % 


11 kDa 
Peptide 
Mole % 




Cys 


1.93 


3.38 


2.79 


2.88 




Asx 


10.89 


8.17 


17.96 


9.47 


10 


Met 


3.13 


8.00 


2.35 


8.70 




Thr 


1.76 


1.03 


4.10 


3.02 




Ser 


9.62 


9.00 


7.05 


7.43 




Glx 


21.86 


19.39 


15.42 


21.80 




Pro 


0.00 


2.65 


3.67 


3.02 


15 


Gly 


14.01 


9.43 


5.85 


6.64 




Ala 


12.99 


10.72 


5.29 


11.10 




Val 


0.00 


0.00 


3.63 


0.42 




lie 




5. 90 


4.46 


4.07 




Leu 


5.33 


8.96 


6.84 


8.32 


20 


Tyr 


0.38 


0.64 


2.45 


0.00 




Phe 


0.76 


0.54 


1.90 


0.31 




His 


2.93 


1.11 


3.09 


1.24 




Lys 


4.24 


8.11 


6.43 


8.85 




Arg 


3.58 


2.96 


6.80 


1.85 
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The amino terminal sequence of all small and large 
chain peptides is determined from PVDF blots of the 
respective elect rophoretic peptide bands by automated Edman- 
degradation in an Applied Biosystems sequencer. The amino- 
terminal sequences of albumin 1 and 2 are identical. The 
amino-terminal sequences of albumin 3 are different than 
those of albumin 1 and 2. However, the amino-terminal 
sequences of albumin 3 have a high degree of homology to the 
amino-terminal sequences of albumin 1 and 2 (about 80%). 
These amino-terminal sequences are most closely related to 
sequences found in conglutin 6, a sulfur rich 2S protein 
from Lupinus angustif alius L (Gayler, et al.; Plant Mol. 
Biol^; vol. 15; p. 879; (1990); incorporated herein in its 
entirety by reference) . 

To elucidate the differences between the Al peptides, 
the albumin fractions are further analyzed by isoelectric 
focusing. The isoelectric point of All i s determined at a 
PH of 6.05, of A12 at a pH of 5.45 and of A13 at a pH of 
4.95, respectively. 
» Since the cdna specific for All and A13 encode a 

consensus sequence for asparagine linked N-glycosylation, 
(see below), concanavalin A binding to albumin-containing 
fractions is analyzed. None of the All peptides bind 
concanavalin A, nor are molecular weight size differences 
apparent in SDS page after N-Glycosidase F treatment 
Therefore, N-glycosylation of soybean albumins seems 
unlikely. 

The protein sequencing data, together with the amino 
acid composition results, indicates the occurrence of the 
following distinct yet undescribed methionine- and lysine- 
rich albumin gene products, i.e. All and A12, and A13, in 
soybean seeds. The similarity of the N-terminal All and A12 
amino acid sequences can be explained by the assumption of 
differential post-translational processing events of the 
35 same gene product. 
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EXAMPLE II. 

Isolation of Albumin-specific cDKA Clones 
from a Soybean Seed cDNA Library RNA isolation, 
cDNA synthesis and sequence analysis. 

DNA isolation, DNA manipulations, radiolabelling of DNA 
and hybridizations are done essentially as described by 
Sambrook, et al.; Molecular Cloning: A Laboratory Manual , 
Cold Spring Harbor (1989); incorporated herein in its 
entirety by reference. 

Soybean plants (Glycine max Merr.) are grown in the 
greenhouse or in the field. Developing, mid-maturation 
soybean seeds are harvested and stored frozen at -80°C to be 
used as a mRNA source for cDNA library construction. 

Total RNA is isolated from pooled developing soybean 
seed (1-15 mm in size) . Frozen seeds (1-2 g fresh weight) 
are ground to a powder in a pestle and mortar, and RNA is 
isolated according to methods described in Shure, et al.; 
Cell; Vol. 35; p. 225-233; (1983); incorporated herein in 
its entirety by reference. mRNA is isolated from 1 mg total 
RNA using an oligo-dT Sepharose spin, column according to the 
manufacture's instructions (Pharmacia, Uppsala, Sweden). 
Five ^g of purified mRNA is used as a template for cDNA 
synthesis and ligation into Stratagene Lambda Zap II vector 
arms according to the manufacturer's instructions 
(Stratagene, La Jolla, CA) . One hundred ng of size selected 
cDNA (>500 bp) is ligated to the vector arms and packaged 
(Stratagene Gigapack Gold) to yield a primary library of 1.2 
x 10 6 pfu with an average cDNA insert size of 1.2 kb. This 
library is amplified in E. coli Sure cells (Stratagene) to 
give a titre of 2 x 10 10 pfu/ml. 

Two hundred random plaques are isolated and re- 
suspended in 500 \il SM. Phagemids (Bluescript S/K) are 
excised from the Lambda ZAP II vector according to the 
protocol recommended by Stratagene using helper phage R408 
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and E. coli host strain XL1 Blue. Single colonies are grown 
overnight in 2 ml of 2 x YT medium containing 100 ug/ml 
ampicillin. The plasmid DNA is isolated by alkaline lysis 
and ethanol precipitation (Sambrook, et al.; Supra ; (1989); 
incorporated herein in its entirety by reference) . 

The 5' sequence from 200 individual cDNA clones is 
obtained using the T3 primer by Tag cycle sequencing on an 
ABI catalyst 8000 Molecular workstation and ABI 1373a 
sequenator (Applied Biosystems) . Sequence data is edited 
manually to remove vector sequence and a database of the DNA 
sequence information from the 200 randomly picked cDNA 
clones using this library is created to facilitate the 
identification and isolation of cDNA clones encoding 
abundant expressed polypeptide sequences for which the 
albumin polypeptides would be an example. 

Identification of Alhnmi n Specific cDNA. Clones 
The cDNA database is searched with back-translated DNA 
sequences corresponding to the amino terminal sequences of 
the All/2 and A13 small and large chains using the FASTA® 
algorithm (Genetics Computer Group, Wisconsin sequence 
analysis package, Version 8). a section of the deduced 
amino acid sequence of clone EST 3_38 is found to display an 
exact match to the amino terminal sequences derived from the 
small and large chains of All. The deduced amino acid 
sequences of clone EST 2_36, clone EST 3_13, clone EST 3_14 
and clone EST 3_62 are found to align exactly with the 
obtained amino terminal sequences of the A13 peptides. 
Furthermore, a computer comparison of the deduced amino acid 
sequence of clone EST 3_38 and of clone 3_62 with the 
Genbank sequence data base using the T FASTA® algorithm 
(Genetics Computer Group) reveal homology to conglutin 5, a 
sulfur-rich 23 protein from lupin seeds ILupinus 
angvstifolius L. ) . (Ga yl er/ e t al.; Supra ; (1990); 

incorporated herein in its entirety by reference) . 
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An about 600 bp EcoRI fragment from clone EST 3_38 and 
an about 400 bp EcoRI/SacI fragment from clone EST 362 are 
labeled with [ 32 P] dCTP (Amersham) using the Ready Prime kit 
from Amersham. The labeled fragments are used to screen 
5 15,000 recombinant phages from the cDNA library derived from 
developing soybean seeds in Lambda ZapII (Stratagene) . 
Approximately 3% of the clones in the library hybridize to 
both albumin probes. 

Forty-five albumin specific phages are randomly 
10 selected and the corresponding phagemids are subsequently 
excised according to the manufacturer's recommendations and 
sequenced. Among the sequenced clones, 42 are found to be 
albumin 3 specific (7 encoding the entire coding sequence) 
and 3 are found to be albumin 1 specific (one encoding the 
15 entire coding sequence) . 

The inserts of the longest identified All and A13 
specific clones, pAll_42 and pA13_49, respectively, are 
sequenced in their entirety {Figures 2 and 3) and 
consequently entered into the Pioneer plasmid collection 
20 under the names p9330 and p9331 respectively. Sequence 
analysis clearly identifies that these clones contain full- 
length coding sequences, encoding both the N-terminal signal 
peptides and the stop codon. 

Albumin 1 is encoded by 465 base pairs comprised in a 
25 723 base pair cDNA (SEQ ID NO: 1). This cDNA encodes a pre- 
propeptide having 155 amino acids (SEQ ID NO: 2) . The pre- 
propeptide comprises a 20 amino acid signal peptide, about a 
55 amino acid small chain, and about a 80 amino acid large 
chain. The mature albumin protein comprises two disulfide 
30 linked chain, a 4-5 kDa small chain and a 10 kDa large 
chain. The amino acid composition of the deduced amino 
acid. Sequence of albumin 1 includes 11.8 mol% methionine 
and cysteine residues, 9.6 mol% lysine residues and 12.6 
mol% asparagine and glutamine residues. 
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Albumin 3 is encoded by 474 base pairs comprised in a 
777 base pair cDNA (SEQ ID NO: 3) . This cDNA encodes a pre- 
propeptide having 158 amino acids (SEQ ID NO: 4). The pre- 
propeptide comprises a 21 amino acid signal peptide, about a 
60 amino acid small chain and a 77 amino acid large chain. 
The mature albumin 3 contains two disulfide linked chains. 
The deduced amino acid composition of albumin 3 includes 
11.6 mol% methionine and cysteine residues, 10.2 molS lysine 
residues, and 13.2 mol% asparagine and glutamine residues. 

EXAMPLE III. 

To further enhance for the preferred amino acid 
residues lysine and methionine and to further reduce non- 
preferred amino acid residues asparagine and glutamine, a 
cDNA encoding a chimeric albumin, termed albumin 1/3 (Al 
1/3) (SEQ ID NO: 6) , is prepared based on a GAP alignment 
(Genetics Computer Group) of the amino-acid sequences of All 
and A13 (Fig. 5) . 

The cDNA clone P 9331 (pA13_49) is modified by 
oligodexyribonucleotide-directed mutagenesis using the Muta- 
Gene Phagemid in vitro Mutagenesis kit from BioRad 
(Hercules, CA) based on the Kunkel method (Kunkel, T A 
Proc. Nat. Acad. Sci. USA , Vol. 82; p. 488; (1985); 
incorporated herein in its entirety by reference) according 
to the manufacturer's recommendations. Mutagenesis is 
carried out in five consecutive repetitions of in vitro 
mutagenesis with five oligodeoxyribonucleotide primers 
pnmers and the changes they confer to the cDNA sequence are 
summarized in Table 2. 
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The amino acid codons at the indicated positions of the 
cDNA encoding A13 are essentially only changed into codons 
which encode preferred amino acids found at the same 
relative positions (GAP alignement) in the protein sequence 
of All. Thus the resulting amino acid sequence Al 1/3 is 
termed a chimeric albumin. All changes of amino acid 
residues are made in sequence regions which are considered 
important for the protein structure of related 2S albumins 
from seeds of other plant species and are therefore not 
obviously amenable for a change. Nevertheless, because the 
amino acid residues in Al 1/3 are already present in either 
All or A13, the structure of the chimeric protein is 
unlxkely to exhibit any deleterious effects when expressed 
xn a seed. Albumin 1/3 has 158 amino acids (Fig. 6) The 
ammo acid composition of albumin 1/3 includes 12.4 mol% 
methionine and cysteine residues, 13.14 mol% i ysine 
residues, and 10.3 mol% asparagine and glutamine residues. 

EXAMPLE IV- 

with 



Soybean (Glycine max) seed, is surface sterilized by 
exposure to chlorine gas evolved in a glass bell jar. Gas 
xs produced by adding 3.5 ml hydrochloric acid (34-37% w/w) 
to 100 ml sodium hypochlorite (5.25% w/w, . Exposure is for 
16-20 hours in a container approximately one cubic f oot in 
volume. surface sterilized seed i s stored in petri dishes 
at room temperature. Seed is germinated by plating on l/io 
strength agar solidified medium according to Gamborg, e t 
aL; ( Exp. cell. r.^ . Vol . 50/ pp lfil _ 158; 

-incorporated herein in its entirety by reference,. ( B 5 
basal medium with minimal organics, Sigma Chemical Co Cat 
no. G5893; 0.32 g/ L; sucrose , fl>2% ^ ^ ' ^ 

morpholinojethanesulfonic acid ( MES) , 3.0mM, without plant 
growth regulators and culturing at 28'C with a 16 hour day 
length and cool white fluorescent illumination of 
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approximately 20 mEmV. After three or four days, seed is 
prepared for co-cultivation. The seed coat is removed and 
the elongating radical is removed 3-4mm below the 
cotyledons. Ten prepared seeds are held in each of several 
petri dishes. 

Construction of Plant Gan g Expression Casaatta* 

The expression cassette containing one copy of a 
soybean albumin gene under the control of phaseolin 
regulatory sequences is the binary plasmid p9127. P 9i 2 7 i s 
constructed in several steps beginning with 
oligodeoxynucleotide directed mutagenesis of P 9330 ( P A11_42) 
which contains the full-length cooling sequence of the All 
protein in the plasmid backbone of Bluescript SK 
(Stratagene®) . Mutagenesis is carried out as described in 
Example III with oligodeoxyribonucleotide: 

1) 5'GCACGAGTCATGACCAAGTCACAATTCTC 3' (SEQ ID NO: 12); and 

2) 5' TCCTCCGATGACTGAGTTAACAAAAAAAGTACTAC 3' (SEQ ID NO: 13); 
so that an Real site is placed and a Hindlli site is 
destroyed at the start of translation and an Hpal site is 
added just 3' of the stop codon. Upon digestion with the 
restriction endonucleases Rcal/Hpal, a 472 base-pair DNA 
sequence corresponding to the full length coding sequence of 
All is isolated and cloned into p4752 (Ncol/Hpal) . P 4752 
(Fig. 6) contains 883 base pairs of the phaseolin 5' 
regulatory sequences (i.e. promoter) followed by 84 base 
pairs of the phaseolin 5' untranslated region. Immediately 
3' to these sequences are an Ncol site and Hpal site to 
facilitate cloning in the 5' -+ 3' direction of an open 
reading frame resulting in the codon methionine start 
translation generated by the Ncol site (- CCATGG) becoming 
the translational start codon. Downstream of the Hpal site 
is 1230 base pairs of phaseolin 3' regulatory sequences. 
P4752 thus contains the phaseolin promoter: phaseolin 
terminator. 
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The resulting plasmid, p9069 is then digested with the 
restriction endonucleases EcoRI/Hindlll and the phaseolin 
promoter: All :phaseolin terminator portion is inserted into 
the EcoRI/Hindlll site of plasmid pl830 (=p A RC12, (Prosen, 
et al.; Biotechnology ; Vol. 5; p. 966; (1987); incorporated 
herein in its entirety by reference) . Plasmid P 1830 is a 
29.5 Jcb plasmid which is part of a binary vector system of 
Agrobacterium and contains the chimeric gene nopaline 
synthase/neomycine phosphotransferase II as a selectable 
marker for plant cells. 

The plasmid resulting after the insertion of the 2.89 
kb fragment of P 9069 inserted into p!830 is termed P 9127 
Plasmid p9127 is about 33 *b in si2e and confers resistance 
to tetracycline to the bacterial host. 

The plasmid is then transformed to Agrobacterium 
t»e/aciens strain LBA 4404 by the freeze/thaw method, 
known in the art. The presence of the binary plasmid in the 
resulting bacteria is confirmed by Southern blot analysis, 
grgear .tion of Affrobaetarlm, tnmefm ^ m lAtu ^ p 0 , 

<*erni,ht culture of Agrobacteriu* tumefaciens straln 
4404 harboring the binary plasmid p9127 grown to lo, 
Phase in Minimal A medium containing tetracyline, 1.0 mg/ml, 
is pooled and an optical density measurement at 550 nm is 
taken Sufficient volume of the culture is placed in 15 ml 
conical centrifuge tubes such that upon sedimentation 

whT'n I'l ^ " 10 '° Ce " S ~»«*" *» -<* tube, 
where O.D.550 of 1.0 - l 4 v m 9 -.n./ , „ 

cells/ml. Sedimentation is 
by centrifugation at 6000 g for 10 minutes> 

centrifugation the supernatant is decanted and the tubes are 
held at room temperature until inoculum is needed, but not 
longer than one hour. r 

Transformation 

,ut.T?" are conductad ln batches such that «* 

Plate of seed is treated with a newly resuspended pellet of 
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Agrobacterium. One at a time, the pellets are resuspended 
in 20 ml inoculation medium. Inoculation medium consist of 
B5 salts (Sigma Chemical Co.), 3.2 g/L; sucrose, 2.0% w/v 6- 
benzylaminopurine (BAP), 44 mM; indolebutyric acid (IBA), 
0.5 mM; acetosyringeone (AS), 100 mM and is buffered to pH 
5.5 with MES, 10 mM. Resuspension is by vortexing. The 
inoculum is then poured into a petri dish containing 
prepared seed and the cotyledonary nodes are macerated with 
a surgical blade. This is accomplished by dividing seed in 
half by longitudinal section through the shoot apex 
preserving the two whole cotyledons. The two halves of the 
shoot apex are then broken off their respective cotyledons 
by prying them away with a surgical blade. The cotyledonary 
node is then macerated with the surgical blade by repeated 
scoring along the axis of symmetry. Care is taken not to 
cut entirely through the explant to the adaxial side. 
Twenty explants are prepared in roughly 5 minutes and then 
incubated for 30 minutes at room temperature without 
agitation. Additional plates are prepared during this time. 
After 30 minutes the explants are transferred to plates of 
the same medium solidified with Gelrite (Merck & Co., Inc.), 
0.2% w/v. Explants are embedded with the adaxial side up 
and level with the surface of the medium and cultured at 
22°C for three days under cool white fluorescent light, 
approximately 20 mEm 2 S 1 . 

Culture and Selection 
After three days the explants are moved to liquid 
counterselection medium. Counterselection medium consists 
of B5 sales, 3.2 g/L; sucrose, 2.0% w/v; BAP, 5.0 mM; IBA 
0.5 mM; vancomycin, 200 mg/ml; cefotaxime, 500 mg/ml and is 
buffered to pH 5.7 with MES, 3 mM. Ten explants are washed 
in each petri dish with constant, slow gyratory agitation at 
room temperature for four days. Counterselection medium is 
replaced four times. 
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The explants are then picked to agarose solidified 
selection medium. Selection medium consists of B5 sales, 
3.2 g/L; sucrose, 2.0%, w/v; BAP, 5.0 mM; IBA, 0.5 m] 
kanamycin sulfate, 50 mg/ml and is buffered to pH 5.7 with 
MES, 3.0 mM. Selection medium is solidified with SeaKem 
agarose, 0.3% w/v. The explants are embedded in the medium, 
adaxial side down and cultured at 28°C with a 16 hour day 

length and cool white fluorescent illumination of 60-80 
mEm^S 1 . 

After two weeks explants are again washed with liquid 
medium on the gyratory shaker. This time the wash is 
conducted overnight in counterselection medium containing 
kanamycin sulfate, 50 mg/ml. The following day explants are 
picked to agarose solidified selection medium. Again they 
are embedded in the medium, adaxial side down; the culture 
is as before for another two week period. 

Regeneration 

After one month on selective media transformed tissue 
becomes visible as green sectors of regenerating tissue 
against a background of bleached, less healthy tissue. 
Explants without green sectors are discarded, explants with 
green sectors are transferred to elongation medium. 
Elongation medium consists of B5 salts, 3.2 g/L; sucrose, 
2.0% w/v; IBA, 3.3 mM; gibberellic acid, 1.7 mM; vancomycin,' 
100 mg/ml; cefotaxine, 30 mg/ml; and timentin, 30 mg/ml, 
buffered to P H 5.7 with MES, 3.0 mM. Elongation medium is 
solidified with gelrite, 0.2% w/v. They are embedded 
adaxial side up and cultured as before. Culture is 
continued on this medium with transfer to fresh plates every 
two weeks. When shoots become 0.5 cm in length they are 
excised at the base and placed in rooting medium in 13 x 100 
mm test tubes. Rooting medium consists of B5 salts, 3.2 
g/L; sucrose, 15 gm/L; nicotinic acid, 20 mM; pyroglutamic 
acid (PGA) , 900 mg/L and IBA, 10 mM. It is buffered to pH 
5.7 with MES, 3.0mM and solidified with Gelrite, 0.23 w/ v . 
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After ten days the shoots are transferred to the same medium 
without IRA or PGA, Shoots are rooted and held in these 
tubes under the same environmental conditions as before. 

When a root system is well established, the plantlet is 
transferred to sterile soil mix in plant cons (ICN 
Biomedicals, Inc., Irvin, CA, cat no. 26-720 & 1-02). 
Temperature, photoperiod and light intensity remain the same 
as before. Under these conditions the regenerates become 
vigorous, mostly normal (though small) plants. When their 
root systems again become well established, a corner of the 
plant cone is cut off and the plants are gradually hardened 
off in an environmental chamber or greenhouse. Finally they 
are potted in soil mix and grown to maturity, bearing seed, 
in a greenhouse. 

Growth, Inc rease, and Harvest of Transgenic Systems 

Seed from untrans formed and transformed plants of the 
same variety is planted in the spring and harvested in the 
fall. Each individual line is kept separate while grown in 
one or more 10.5 foot rows for maximum increase. 

The determination of the levels of a particular protein 
can be determined by methods well known in the art 
including, but not limited to enzyme linked immunoassays, 
immunofluorescent assays, Western blot analysis and 
immunoprecipitation analyses. 

The amino acid content of seeds from transformed and 
untransformed plants is analyzed by methods described in the 
Office Methods of Analysis of the AQAC , Hilrich (ed.), AOAc 
International; Vol. 2; p. 1096; (1990); incorporated in its 
entirety by reference. 

EXAMPLE IV 

Preparation of Albumin-specific Antibodies 

Antibodies specific for albumin polypeptides are 
produced by injecting female New Zealand white rabbits 
(Bethyl Laboratory, Montgomery, TX) six times with 
homogenized polyacrylamide gel slices containing 100 |ig of 
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PAGE purified albumin. Animals are then bled at two week 
intervals. The antibodies are further purified by affinity- 
chromatography with Affigel 15 (BioRad) -immobilized antigen 
as described by Harlow, et al.; Antibodies: A Laboratory 
Manual, Cold Spring Harbor, NY; (1988); incorporated herein 
in its entirety by reference. The affinity column is 
prepared with purified albumin 3 essentially is recommended 
by BioRad®. Immune detection of antigens on PVDF blots is 
carried out following the protocol of Meyer, et al.; j. 
Cell. Biol. ; Vol. 107; p. 163; (1988); incorporated herein" 
in its entirety by reference, using the ECL kit from 
Amersham (Arlington Heights, IL) . 

All publications and patents are incorporated by 
reference herein, as though individually incorporated by 
reference. The invention is not limited to the exact 
details shown and described, for it should be understood 
that many variations and modifications may be made while 
remaining within the spirit and scope of the invention 
defined by the claims. 
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SEQUENCE LISTING 

il) GENERAL INFORMATION: 

;i) APPLICANT: 

<AJ ADDRESSEE: PIONEER HI-BRED INTERNATIONAL, INC. 

:B) STREET: Darwin 31dg., 7100 N.W. 62nd Ave. 

(CI CITY: Johnston 

(D) STATE: Iowa 

:E: COUNTRY; USA 

(F: 21?; 50131-1000 

Ui.J TITLE OF INVENTION: ALTERATION OF AMINO ACID ^MPOS^ T ^ ^* 
SEEDS ~ 

(iii) NUMBER OF SEQUENCES: 13 

(iv; COMPUTER READABLE FORM: 

(Ai MEDIUM TYPE: Floopy disk 

{3) COMPUTER: IBM PC comoatible 

(CI OPERATING SYSTEM: PC- DOS /MS - DOS 

iD; SOFTWARE: Patentln Release #1.0, Version #1.30 

:v: CURRENT APPLICATION DATA: 

:A) APPLICATION NUMBER: PCT: Ur.assignsd 
(3) FILING DATE: Cor.currer._Iv herewith 

i vi ? PRIOR APPLICATION DATA: 

!A* APPLICATION NUMBER: 03/513,911 
(3! FILING DATE: 20-MAR-1996 



(2) INFORMATION FOR SEQ ID NO: I: 

*.L> SEQUENCE CHARACTERISTICS : 

iA5 LENGTH: 723 base oairs 
(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY; CDS 

(B) LOCATION: 10.. 474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GCACGAGAA ATG ACC AAG CTT ACA ATT CTC CTC ATC GCT CTT CTC TTC 
Met Thr Lys Leu Thr lie Leu Leu lie Ala Leu Leu Phe 
15 10 



4a 



ATC GCC CAC ACC TGC TGC GCC TCC AAA TGG CAA CAG CAC CAG CAA GAG 96 
lie Ala His Thr Cys Cys Ala Ser Lys Trp Gin Gin His Gin Gin Glu 
15 20 25 

AGC TGC CGC GAG CAG CTC AAG GGG ATC AAC CTC AAC CCC TGT GAG CAC 144 
Ser Cys Arg Glu Gin Leu Lys Gly lie Asn Leu Asn Pro Cvs Glu His 
30 35 40 " 45 

ATC ATG GAG AAG ATC CAA GCT GGC CGC CGC GGC GAG GAC GGC AGC GAC 192 
He Met Glu Lys He Gin Ala Gly Arg Arg Gly Glu Asp Glv Ser Asp 
50 55 60 
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GAA GAT CAC ATT CTC ATC AGG ACC ATG CC5 GGA AGA ATC AAC TAC A -r 
Asp His lie Leu lie Arg Thr Met Pro Gly Arg lie Asn Tvr lie" 
65 7 0 75 ' 

AGG AAG AAG GAA GGA AAA GAA GAA GAA GAA GAA GGA CAC ATG CAG AAG 

" y3 "X* ^ V LyS G1U Giu Glu Giu Glu G1 V His Met Gin Lvs 

80 85 go 

TGC TGC AGC GAA ATG AGC GAG CTG AAA AGC CCC ATA TGC CAG rr. C AAA 
.75 ser o.u Met Ser Glu Leu Lys Ser Pro lie Cys Gir. Cys Lvs 

1 105 

GC3 CTA CAG AAG ATA ATG GAT AAC CAG AGC GAG CAA CTG GAG SGG *AG 
A,a «u Gin Lys He Met Asp Asn Gin Ser Giu Gir. Leu Giu £y Lv* 
iiJ 115 120 i 2 l 

GAG AAG AAG CAG ATG GAG AGA GAG CTC ATG AP<~ T^- nr T A""" Ar- -r- 
Giu Lys Lys Gin Met Giu Arg Giu Leu Met Asn Leu Ma lie kr* Cvs 

135 14 5 

AGG 7TG GGA CCC ATG ATA GGG TGC GAC TTG ICC TCC GA" GA r 
Arg Leu G.y Pro Met lie Gly Cys Asp Leu Ser Ser Aso Asc 
- 45 150 * ^c= 

-L — V 

TGAAAAAAAA GTACTACTAA CACATATATG TGTTAGTTTA TGCTAGCTAG AAGAAC3-»- 
AAGCTATCTC CGTATGTTGT ATATTAATAA AAAGATCA7C ACTGGTGAAT 3GTGATC3TG 
TATGTAACGT AGTGGGCAAT GGAAGCACTT AGAGTGTGCT TTGTGGCCTT GCCCTC^G™ 
TTGATAACTG AGACTTTTGC GAATACCGTT CGTTTTTCCC TTCAAAAAAA AA*JVAAAAAA 
AAAAAAAAA 



240 



288 



4-4 



t2; INFORMATION FOR SEQ ID NO:2; 

ii) SEQUENCE CHARACTERISTICS : 

iAJ LENGTH: 155 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

:ii) MOLECULE TYPE: protein 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Lys Leu Thr lie Leu Leu lie Ala Leu Leu Phe lie Ala His 

* 10 15 

Thr Cys Cys Ala Ser Lys Trp Gin Gin His Gin Gin Glu Ser Cys Arg 

25 30 

Glu Gin Leu Lys Gly lie Asn Leu Asn Pro Cys Glu His lie Mer G 'u 
J3 40 45 

Lys lie Gin Ala Gly Arg Arg Giy Glu Asp Gly Ser Asd Glu Asp His 

53 60 

lie Leu lie Arg Thr Met Pro Giy Arg lie Asn Tyr He Arg Lys Lys 

75 80 
Glu Gly Lys Giu Giu Giu Giu Giu Gly His Met Gin Lys Cys Cys S r 



90 95 
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Glu Mec Ser Glu Leu Lys S r Pro lie Cys Gin Cys Lvs Ala Leu Gin 
10u 105 110 

Lys lie Met Asp Asn Gin Ser Glu Gin Leu Glu Glv Lys Glu Lys Lvs 
115 120 X25 

Sir. Mec Glu Arg Giu Leu M z Asn Leu Aia He Arg Cys Arg Leu Glv 
A - c 135 140 

Pro Met lie Gly Cys Asp Leu Ser Ser Asp Asp 
"5 150 15 ? 

i2) INFORMATION FOR SEQ ID NO:3: 

(ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
iD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



Ux' FEATURE: 

(A; NAME /KEY: CCS 

(B) LOCATION: 28.. 501 



99 



14* 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAGCTCGTGC CGAATCGGCA CGAGAAA ATG ACC AAG TTC ACA ATC CTC CTC 

Met Thr Lys Phe Thr He Leu Leu 
160 

ATC TCT CTT CTC TTC TGC ATC GCC CAC ACT TGC AGC GCC TCC AAA r GG 
iie Ser Leu Leu Phe Cys He Ala His Thr Cys Ser Aia Ser Lys T~c 
lo5 170 !75 

CAG CAC CAG CAA GAT AGC TGC CGC AAG CAG CTC CAG GGG G T G «VAC ^ 
Gin His Gin Gin Asp Ser Cys Arg Lys Gin Leu Gin Glv Vai Asn Le*J 
180 185 190 " 195 

ACS CCC TGC GAG AAG CAC ATC ATG GAG AAG ATC CAA GGC CGC GGC GAT ' « 

Thr Pro Cys Glu Lys His He Met Glu Lys He Gin Gly Arg Glv Asp ~ " 

200 205 210 

GAC GAT GAT GAT GAT GAC GAC GAC AAT CAC ATT CTC AGG ACC ATG CGG 243 
Asp Asp Asp Asp Asp Asp Asp Asp Asn His He Leu Arg Thr Met Ara 
215 220 225 

GGA AGA ATC AAC TAC ATA AGG AGG AAC GAA GGA AAA GAC GAA GAC GAA 29« 

Gly Arg He Asn Tyr lie Arg Arg Asn Glu Gly Lys Asp Glu Asd Glu 
230 235 240 

GAA GAA GAA GGA CAC ATG CAG AAG TGC TGC ACA GAA ATG AGC GAG C^G 339 
Glu Glu Giu Gly His Met Gin Lys Cys Cys Thr Glu Met Ser Glu Leu 
245 250 255 

AGA AGC CCC AAA TGC CAG TGC AAA GCG CTG CAG AAG ATA ATG GAG AAC 337 
Arg Ser Pro Lys Cys Gin Cys Lys Ala Leu Gin Lys He Met Glu Asn 
2^0 265 270 2 75 

CAG AGC GAG GAA CTG GAG GAG AAG CAG AAG AAG AAA ATG GAG AAG GAG 435 
Gin Ser Glu Giu Leu Giu Glu Lys Gin Lys Lys Lys Met Glu Lys Glu 
280 295 290 
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2 £ £S S S S £? 5S £ S S? S £ S S2 K - 

300 3C5 
2 III S So IAMM6 ™ AAAGCAATGT TGTCACTTGT 

310 

ACGTACTAAC ACATGATGTG ATAGTTTATG CTAGCTAGCT ATAACATAAG CTGTC T GTGA 
GTGTGTTGTA TATTAATAAA GATCATCACT GGTGAATGGT GATCGTGTAC GTACCCTAC"* 
TAGTAGGCAA TGGAAGCACT TAGAGTGTGC TTTGTGCATG GCCTTGCCTC TCTGAGA 
CTTTTGTAAT GTTTTCGAGT TTAAATCTTT GCCTTTGCGG AAAAAAAAAA AAAAAAAAA? 



AAAAAA 



(2) INFORMATION FOR SEQ ID NO: 4: 

US SEQUENCE CHARACTERISTICS • 

«A! LENGTH: 153 amino acids 
<B> TYPE: amino acid 
OS TOPOLOGY: linear 

iiii- MCLECULE TYPE: protein 

(Xij SEQUENCE DESCRIPTION: SEQ ID NO: 4; 
Met Thr Lys Phe Thr lie Leu Leu lie S er Leu Leu Phe Cys lie 

10 15 
His Thr Cys Ser Ala Ser Lys Trp Gin His Gin Gin Asp S er Cys Ar g 

^ys Gin Leu Gin Gly Val Asn Leu Thr Pro Cys Glu Lys 2 „. Met 

Glu Lys lie Gin Gly Arg Gly Asp Asp Asp Asp Asp Asp Asp Asp Aso 

" 60 
Asn His lie Leu Arg Thr Met Arg Gly Arg He Asn Tyr lie Arg Arg 

75 80 

Asn =!» «„ ly . Asp Gln Asp «„ «u Gl» «„ „„ „. Met Cl„ Lys 

c ys », aj net s« Giu ^ ^ Ser Pco tys Cys ^ ^ 

Ala Leu Gin Lys lie Met Glu Asn Gin Ser Glu Glu Leu Glu Glu Lys 

Oln Lys Lys Lys Met Glu Ly f Glu Leu lie Asn Leu J. Thr Met Cys 

1J;) 140 y 

Arg Phe Gly Pro Met lie Gin Cys Asp Leu Ser Ser Asp Asp 

150 155 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 777 base pairs 



531 

591 
65; 

7il 
77; 

777 
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(B) TYPE: nucleic acid 
Id STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



ACGTACTAAC ACATGATGTG ATAGTTTATG CTAGCTAGCT ATAACATAAG CTGTCTCTGA 
G7GTGTTGTA TATTAATAAA GATCATCACT GGTGAATGGT GATCGTGTAC G7ACCCTACT 
TAGTAGGCAA TGGAAGCACT TAGAGTGTGC TTTGTGCATG GCCTTGCCTC TGTTTTGAGA 
CTTTTGTAAT GTTTTCGAGT TTAAATC7TT GCCT7TGCGG AAAAAAAAAA AAAAAAAAAA 



99 



243 



vix; FEATURE: 

(A) NAtHE/KEY: CDS 
(BJ LOCATION: 28.. 501 

(Xi 4 SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAGCTCGTGC CGAATCGGCA CGAGAAA ATG ACC AAG TTC ACA ATC CTC CTC 

Met Thr Lys Phe Thr lie Leu Leu 
160 165 

ATC TCT CTT CTC TTC TGC ATC GCC CAC ACT TGC AGC GCC TCC AAA TGG 
He Ser Leu Leu Phe Cys lie Ala His Thr Cys Ser Ala Ser Ly£ £ 
10 175 180 

G^n -In ™ f T f ° I GC CGC 06 C - AAG GGG GTG AAC CTC 

Gxn H.s «ln Gin Asp Ser Cys Arg Lys Gin Leu Lvs Giy Vai Asn Leu 

" 5 190 195 

ACG CCC TGC GAG AAG CAC ATC ATG GAG AAG ATC CAA GGC CGC GGC GA~* 
Thr Pro Cys G,u Lys His lie Met Glu Lys lie Gin Gly Arg slv Asc 
*w 205 210 

GAC GAT GAT GAT GAT GAC GAC GAC AAT CAC ATT CTC AGG AC V", C "~ 
Asp Asp Asp Asp Asp Asp Asp Asp Asn His He t£ J£J & 
2i5 220 225 230 

GGA AGA ATC AAC TAC ATA CGT AAG AAG GAA GGA AAA GAC GAA GAC GAA 
-ly Arg lie Asn Tyz He Arg Lys Lys Glu Gly Lys Asd Giu Asp Glu 
235 240 * 245 

GAA GAA GAA GGA CAG ATG CAG AAG TGC TGC ACA GAA ATG AGC GAG CTT 
Glu Glu Giu Gly Gin Met Gin Lys Cys Cys Thr Glu Met Ser Glu Leu 
250 255 260 

AAG AGC CCC AAA TGC CAG TGC AAA GCG CTG CAG AAG ATA ATG GAG AAC 
Lys Ser Pro Lys Cys Gin Cys Lys Ala Leu Gin Lys He Met Glu Asn 
265 270 275 

CAG AGC GAG GAA CTG GAG GAG AAG GAG AAC AAG AAA ATG GAG AAG GAG 

™Z lu Glu Leu Glu Glu L * s Glu *sn Lys Met Glu Lvs Glu 
280 285 290 

CTT ATG AAC TTG GCT ACT ATG TGC AGG TTT GGG CCC ATG ATC GGA TGC 
Leu Met Asn Leu Ala Thr Met Cys Arg Phe Gly Pro Met He Glv Cys 
295 300 305 " 3i 0 

GAC TTG TCC TCC GAT GAC TAAGAAGTTA AAAGCAATGT TGTCACTTGT «i 
Asp Leu Ser Ser Asd Asp 
315 



339 



33- 



435 



4S3 



591 
651 
711 
771 
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[2) INFORMATION FOR SEQ ID NO:6: 



ii) SEQUENCE CHARACTERISTICS: 

i A) LENGTH: 153 amino acids 
(3; TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met Thr Lys Phe Thr He Leu Leu He Ser Leu Leu Phe Cys lie VLa 
1 5 ic 15 * 

His Thr Cys Ser Ala Ser Lys Trp Gin His Gin Gin Asd Ser r ys a™ 

25 30 

Lys Gin Leu Lys Gly Val Asn Leu Thr Pro Cys Glu Lys His lip Me- 
35 40 45 

Glu Lys He Gin Gly Arc, Gly Asp Asp Asp Asp Asd Asd Asp Asd Asd 

55 60 

Asn His lie Leu Arg Thr Met Arg Gly Arg He Asn Tyr lie Arg Lvs 
6 ° 70 75 * go 

Lys Glu Gly Lys Asp Glu Asp Glu Glu Glu Glu Gly Gin Mec Gin Lys 
85 90 95 

Cys Cys Thr Glu Met Ser Glu Leu Lys Ser Pro Lys Cys Gin Cys Lys 
100 105 ll0 * y 

Ala Leu Gin Lys He Met Glu Asn Gin Ser Glu Glu Leu Glu Glu Lys 
115 120 !25 

Glu Asn Lys Lys Met Glu Lys Glu Leu Met Asn Leu Ala Thr Met Cys 

Arg Phe Gly Pro Met lie Gly Cys Asp Leu Ser Ser Asp Asp 

150 155 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 32 base pairs 
fB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCTGCCGCAA GCAGCTTAAG GGGGTGAACC TC 
(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(iiJ MOLECULE TYPE: other nucleic a 
(A- DESCRIPTION: /desc = "Pri 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGAAGAATCA ACTACATACG TAAGAAGGAA GGAAAAGACG 
(2) INFORMATION FOR SEQ ID NO: 9: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: sinale 

(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: other nucleic a-:- 
(A) DESCRIPTION: /desc = "Primer" 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GCTGCACAGA AATGAGCGAG CTTAAGAGCC CCAAATGCCA GTGC 

(2! INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 
(Ai LENGTH: 48 base pairs 
(B) TYPE: nucleic acid 
(C! STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(Ai DESCRIPTION: /desc = "Primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAGGAGAAG GAGAAGAAGA AAATGGAGAA GGAGTTCATG AACTTGGC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: ether nucleic ac*d 
(A) DESCRIPTION: /desc = "Primer" 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
GCAGGTTTGG GCCCATGATC GGGTGCGACT TGTCCTC 
(2! INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

lii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "Primer" 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GCACGAGTCA TGACCAAGTC ACAATTCTC 
(2) INFORMATION FOR SEQ ID NO: 13: 

U: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
(Ci STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc ~ "Primer" 



(xii SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TCCTCCGATG ACTGAGTTAA CAAAAAAAGT ACTAC 



51 



WO 97/35023 



PCT/US97/04409 



WHAT IS CLAIMED IS: 

1. 

An isolated and purified DNA molecule comprising a 
preselected DNA segment encoding a seed storage protein. 

The DNA molecule of Claim 1 wherein the seed storage 
protein is a soybean seed storage protein. 

3. 

The DNA molecule of Claim 1 wherein the soybean seed 
storage protein is an albumin. 

4. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment encodes a protein having SEQ ID NO: 2. 

5. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment has at least about (60%) identity with SEQ ID NO: 1, 
between base pairs 10 through 474. 

6. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment is SEQ ID NO: 1, modified by site-directed 
mutagenesis, such that the nutritional quality of the 
protein encoded is enhanced. 

7. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment encodes a protein having SEQ ID NO: 4. 

8. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment has at least about 60% identity with SEQ ID NO: 3, 
between base pairs 28 through 501. 

9. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment is SEQ ID NO: 3, modified by site-directed 
mutagenesis, such that the nutritional quality of the 
protein encoded is enhanced. 

52 



WO 97*35023 



PCT/US97/04409 



10. 

The DNA molecule of Claim 1 wherein the preselected DNA 
segment encodes a protein having SEQ ID NO: 5. 

11. 

5 An expression cassette comprising a preselected DNA 

segment encoding a soybean seed storage protein, operably 
linked to a promoter functional in a host plant cell. 

12. 

The expression cassette of Claim 11 wherein the 
10 promoter is a seed- specif ic promoter. 

13. 

A method of increasing the level of preselected amino 
acid in the seed of a plant, comprising: 

a) introducing into the cells of the plant 
15 an expression cassette comprising a 

preselected DNA segment encoding a 
soybean seed storage protein comprising 
at least one preselected amino acid, 
operably linked to a promoter functional 
in the cells of the plant to yield 
transformed plant cells; 

b) regenerating a transformed plant from 
the transformed cells; and 

c) isolating seeds from the regenerated 
transformed plant in which the seeds 
comprise the seed storage protein in an 
amount sufficient to increase the amount 
of the preselected amino acid in seeds 
of the transformed plant relative to the 
amount of the preselected amino acid in 
seeds of a corresponding untransformed 
plant. 

14. 

The method according to Claim 13 wherein the plant is 
soybean . 
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15. 

The method according to Claim 14 wherein the 
preselected amino acid is lysine. 

16. 

The method according to Claim 15 wherein the 
preselected amino acid is methionine or cysteine, in 
addition to lysine. 

17. 

The method according to Claim 16 wherein the amount of 
lysine in the seed is increased by at least about 5-10%. 

18. 

The method according to Claim 17 wherein the amount of 
methionine and cysteine int he seed is increased by at least 
about 15-30%. 

19. 

A seed produced by the method of Claim 13. 

20. 

A plant produced from the seed of Claim 19. 

21. 

A fertile transgenic plant containing an isolated 
preselected DNA segment comprising a promoter and encoding a 
soybean seed storage protein, which comprises at least one 
preselected amino acid selected from the group consisting of 
methionine, cysteine, and lysine, under the control of the 
promoter, wherein the DNA segment is expressed as the seed 
storage protein so that the level of a seed storage protein 
amino acid in the seeds of the transgenic plant is increased 
above the level in the seeds of a soybean plant which only 
differ from the seeds of the transgenic plant in that the 
DNA segment is not artificially introduced, and wherein the 
DNA segment is transmitted through a complete normal sexual 
cycle of the transgenic plant to the next generation. 

22. 

An antibody capable of specifically binding soybean 
albumin. 
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23. 



The antibody of Claim 22 which is capable of 
specifically binding a protein having SEQ ID NO: 2 or SEQ ID 
NO: 4. 



24. 

A 



protein encoded by a preselected DNA segment 
contained in an isolated and purified DNA molecule; 
according to Claim 4. 



25. 



The protein of claim 24 characterized in that the 
protein has SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 5. 

26. 

A method for isolating and purifying 2S albumins 
comprising the step of separating the albumins from 
contaminating proteins by specifically interacting the 
albumins with the matrix of a carbohydrate resin. 

27. 

The method of Claim 26 wherein the carbohydrate is a 
dextran. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



DEC 1 8 2003 

OflCIAL 



' Appl.No.: 09/478,567 Confirmation No.: 1859 

Applicant(s): Rao et al. „ : 

Filed: January 6, 2000 

Ait Unit: 1638 

Examiner: R. Kallis 

Title- COMPOSITIONS AND METHODS FOR ALTERING 
AMINO ACID CONTENT FOR PROTEINS 



Docket No.: 5718-168(35 718/193735) 
Customer No.: 29122 

August 14, 2003 

Comm i ssioner for Patents 
P.O.Box 1450 
Alexandria, VA 223 1 3-1 450 

AMENDMENT 
37 C.F.R. § 1.121 

Sir: 

In response to the Office Action dated June 17, 2003, please amend the above-identified 
application as follows: 

Amendments to the Specification begin on page 2 of this paper. 

Amendments to the Claims are reflected in the listing of claims beginning on page 5 of this 
paper. 

Rcmarks/Argnineuts begin on page 8 of this paper. 
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Amendment ft** Specification: 

Li the specification: 

Please revise the first full paragraph beginning on page 3, line 2, as follows: 

Fig. Figures 1 A and IB shows show VSP- homologies te&SUUia B**** storage protein 

( VSP) and o ther proteins, as follows. 

Please revise the first full paragraph beginning on page 3, line 3, as follows: 
VSP-b (same as VSPfQ and VSP-a (same as VSPa): Staswick, P.E., (1988), Plant 

Physiol. 87, 250-254. Th» ^i™ acid seq u e nce of the V SP-b protein is set" forth in SEQ JB 

yrv . th. am i»n arid se quence of the jgSJEjUH^^ 

Please revise the first full paragraph beginning on page 3, line 6, as follows: 

T.phos (tomato acid phosphatase): Erion, J.L., Ballo, B., May, L., Bussell, J.. Fox, T.W., 

& Thomas, S.R., SwissProt database accession number P27061. The ainjno acid sequ ^agepf 

this protein is fnrih in SEP m MQ:3. 

Please revise the first full paragraph beginning on page 3, line 9, as follows: 

Ph.vulg (Phaseolus vulgaris): Zhou. P-Y., Tanaka, T.. Yamauchi, D., & Minamikawa T. 

(1997), Plant Physiol. 1 13, 47°"4«< ^ «^ sequence of this iwplsfo i« set forflt in SEQ 

IB N0:4 

Please revise the first foil paragraph beginning on page 3, line 12, as follows: 
Ar.VSP (Arabidopsis thaliana): Yu, D.Y., Quiglcy, F., & Macho, R., EMBL database 
accession number X79490. The amino acid se quence o f tins pro tein j« «< forth in SEQ . F D 

NO:5. 

, Please revise the first: full paragraph beginning on page 3, line 15, as follows: 
Ar 1A-1, Arl 7A-l (Arabidopsis thaliana, floral organs): Utsugi, S., Sakamoto, Ogura, Y. 
Murata, M , & Motoyoshi, F. (1996) Plant Mol. Biol. 32, 7S9-76S. T|,e an.i P o acid peqn enceM 
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iw^ri^l" protein is set f orth in SE P ID NO :6, and th e amino a cjcLsefl ucnce of tlie J!AEl2A= 
l" protei n is set forth in SEP IDNQlL 

Please revise the first full paragraph beginning on page 3, line 18, as follows: 

Fig. 2 shows proposed VSP0 raethionme-enriched variants. The amuno acid seq ugnceof 

thA^pj^io^jaoMato amino acid of t1ie 

M ft *?.ft» protein is set forth in SF f O * P WM. the amino acid salience of tlie "VgPft -MeQE 

pro tein is set fo rth. inSlgQJDNP: 10. 

Please revise die first full paragraph beginning on page 3, line 23, as follows: 
Fig- 4 shows the VSP/3-metlO nucleotide.sequcnce. The VSPff -met 10 imcleotide 
** T ™,*.. j H nlm set fort h in SE P ID NO :l 1. 

Please revise the Abstract, beginning on page 27, line 5 , as follows: 
Methods and compositions for altering amino acid composition of a protein of interest are 
provided, particularly proteins whose three-dimensional structure is unknown. 'Hie method 
comprises creating interacting molecules to the native protein and selecting for engineered 
proteins which retain the native conformation by antibody binding. In this manner, the levels of 
essential amino acids in a protein can be increased yel; the biological activity of the protein 
maintained. ^ ^vwijWi is an exemplary nlant nm tein-Gfrefr* max ve^atiye stores 
Eggfjn £Vgp)- -in which tneth iomnejevgjshaye been increased. 

Please, revise the first full paragraph beginning on page 8, line 14, as follows: 
The transcriptional cassette will include the in S B-S B direction of transcription, a 
transcriptional and translation^ initiation region, a DNA sequence of interest, and a 
transcriptional and translational termination region functional in plants. The termination region 
may be native with the transcriptional initiation region, may be native with the DNA sequence of 
interest, or may be derived from another source. Convenient termination regions are available 
from the Ti-plasmid of A. tumefaciens. such as die octopine synthase and nopaline synthase 
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tei tninsition regions. See also, Guerineau et a!., (1991) Mol. Gen. Genet. 262:141-144; Proudfoot 
(1991) Cell 64:671-674; Sanfacon et ol. (1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant 
Cell 2:1261-1272; Munroe et al. (1 990) Gene 91:15 1-158; Ballas et al. 1 989) Nucleic Acids Res. 
17:7891-7903; Joshi et al. (1987) Nucleic Acid Res. 15:9627-9639. 

Please revise the first full paragraph beginning on page 18, line 4, to read as follows: 

(a) Conserved rcsidu es (higblighteeHii-blue shown in Fig. 1 ) were defined as 
those residues occurring in more than 5 of the 7 homologs. These were not targeted for 
substitution. The exceptions were: at residue numbers 19, 37, 146 and 179 (one of the homologs 
contained a methionine residue); at positions 67, 80, 130 and 169 (conserved hydrophobic amino 
acid exchanges observed in at least one sequence) and at position 50 (non-conservative changes 
from Asn to Ser/Cys in two sequences). 

Please revise the first full paragraph beginning on page 21, line 6, as follows: 

Fifty E. coli colonies containing randomly mutated VSP/J genes were picked as small 
patches to an SB agar plate containing glucose and ampicillin. Patches were allowed to grow 
overnight at 3-7-E1G 37°C and were then transferred to a nitrocellulose filter. On the surface of an 
SB agar- plate containing ampicillin and IPTG, this filter was placed on top (cell-side up) of a 
separate blocked filter to which the antigen (e.g., VSPa) had been coated. During an overnight 
incubation at 30QG 30°C, the cells expressed me VSP/3 variant they encoded. These proteins 
were able to diffuse through the top filter and, if correctly folded, bind the antigen-coated filter 
below. The next day, the antigen-coated filter was washed with PBS-0.05% [tween] Tween^ 
and incubated with HRP/anti-e tag conjugate. Since the VSP/3 mutants are cloned into the 
pCANTAB-SE vector which fuses a C-terminal epitope tog (e-tag) to the VSP/3 protein variants, 
bound proteins were detected by this antibody in combination with enhanced chemihmiinescence 
detection. 



RTA01/2I-I0598VI 



PACE 6/22 • RCVO AT 12/18(2003 11:S3:19 AM [Eastern Standard Time) • SVR:USPTO^ FXRF-1/0 • DNIS:8729306 • CSID: 9198622260 • DURATION <mm-$s):18-14 



DEC. - 1 8' 03 (THU) 11=48 



TEL = 91 98622260 



P. 007 



Appl.No.: 09/478,567 
Filed: January 6, 2000 
Page5 

Ainj^ 

1 . (currently amended) A Nueleie nucleic acid molecule encoding ^mprisingji 
mirjetttide se quence which encodes a n engineered protein having cmnimsingjni amino acid 
^iiftn^fl whinh differs from the amino acid sequen ce of a nati ve protein b\L^^ 
f^nnacj^ said en g ineered protein has a nalterecl amino acid composition in 
fiftinpanson to said native protein , wherein said altered amino acid composition c^jaisegioi 
increa se in essential amino a cid content to at least alxmt_^ 

amuiG-aeid^^ and wherein said engineered protein i^msthe 

conformation o f the native pmtei n and therefore binds to an aUeasto^ 
m tihody, mono clonal antibody, antibody ffo CTOTL^ 

of binding wkh-a to said correspond in£ native protein,, wherein said native pfflteinj^ 

2. (currently amended.) The nueleetide-seqiienee nucleic acid molecule of clai m 1 , 
wherein said altered amino acid changes composition coinpris(^,anJncrease mjhe-levels-e&at 
least-one essential amino acid content to at lea st about 10% h vthe-i>t--0tgm. 

:L (currently amended) The ftuoleotidMequeaee nucleic acid mo lecule encoding of 
claim 3 1 wherein said altere d amino ad djgomi^ 

essential amino acid m selected from the group consisting of methionine, tryptophan, lysine? 
4* (canceled) 

5. (currently amended) A transformed, plant containing within its genome the 
nucleotide sequence of Claim 2- 1_. 
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6. (original) A transformed plant containing within its genome the nucleotide 
sequence of Claim 3. 

7. (currently amended) A-transfarmed The p lant of claim 5, wherein said plant is a 
monocot eantainm^ 

8. (currently amended) A stably transformed plant having inserted into its genome a 
nucleot ide sequence which encodes e hiiHerie-Rene-said^ene-6Hee6Hv>g an engineered protein 
comprising an amino acid s eq uence which diff ers from the amino add_seci fience of a native 
protein b y at least, one essential a mino acid residue h aYir^-altered^ 

wherein- said-protein, wherein said engineered protein has an altered amin o acid coniix>djjcm_in 
comparison to said native p rot ein, wherein said altered am ino acid composi;^ 
increase in es sential amino acid content to at least about 5% and wherein said engineerejj.i3rotejri 
tgjgmsi thft reformation of the nat ive protein and t herefore binds to an-intemefeig-meleeHte at 
1r^j_r^t^^^ mono clonal antibody, antibod y fra^ient v jMTOtg^. or modifi ^JBlBl^ilLwhich 

hinds-with is cap able of binding to said a -eerresiaendm^ native protein, wherei n said na tive 
protein is V SPa or VSPft 

9. (currently amended) The plant of Claim 8, wherein said alta^Uwnino acid 
ehanges c^jbos^ essential amino acid 
content to at l east about 10% in-tfaa-preteiu. 

10. (currently amended) The plant of Claim [[9]] 8, wherein said altered amino acid. 
IH^li^it^ in crease in the content of an essential amino acid [[is]] selected from 
the group consisting of methionine, tryptophan, lysine, valine r phei^ 

#ieronina-and cysteine. 
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J I . (currently amended) The plant of Claim [[10]] 8, wherein said altered amino acid 
^Tjw^jjjnii comprises an increase in methionine conte n t to at lea st about 10% es&Biitial-aiBino 
aeid-te4nei«as6^ 

12. (currently amended) The plant of Claim [[10]] 8, wherein said altered amino add 
g^jgsjjjnn com pr ise s an incre ase i n methionine content to at. least about 20 % essential-ammo 
fl6id-are4neiB^ 

13. (canceled) 

14. (original) The plant of Claim 8, wherein said plant is a dicot. 

15. (original) The plant of Claim 8, wherein said plant is a monocot. 

1 6. (original) The plant of Claim 1 5, wherein said monocot is maize. 

17. (currently amended) The plant of Claim [[15]] 14, wherein said dicot is soybean. 

1 8. (currently amended) Seed A transformed seed of the plant of Claim 8. 

1 9. (currently amended) Seed A transformed seed o f the plant of Claim 1 5 . 

20. (currently amended) Seed A trans formed seed of tlie plant of Claim 1 6. 
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Applicants acknowledge with appreciation the rejoining of the claims of Groups I and IL 
Applicants have amended the specification as suggested in the Office Action; accordingly, the 
objection to tire specification should he withdrawn. 

Claims 1-20 are pending in the application. Claims 4 and 13 have been canceled. Claims 
1-3, 5, 7*12, and 17-20 have been amended. Support for the amendments can he found in the 
specification, particularly on pages 2, 4, 5, 7, 8, and 13 as well as in the original claims. No new 
matter has been added by way of amendment. Re-examination and reconsideration of the claims 
as amended are requested. 

The Reje ction of Claims Under 35 U.S.C.S1 12.. First Par agraph, 
Should Be Withdrawn 

The Office Action (6/17/03, page 2) has rejected claims 1-20 under 35 U.S.C. §1 12, first 
paragraph, as failing to comply with the written description and enablement requirements. 
Applicants respectfully traverse these rejections. 

Independent claims 1 and 8 (and therefore claims 2-3 and 5-7 which are dependent on or 
incorporate the limitations of claim 1 and claims 9-12 and 14-20 which are dependent on or 
incorporate the limitations of claim 8) have been amended for clari fication and to include 
additional limitations. Support for the amendments can be found in the specification, 
particularly on pages 2, 4, 5, 7 9 8, and 13 as well as in the original claims. Independent claims 1 
and 8 now include the limitation that the native protein is VSPo; or VSP/3. 

The present specification discloses the amino acid sequences of VSPa and VSPft the 

sequences of several mefhionine-enriched VSPj3 variants, and a nucleic acid molecule encoding 

an engineered protein (VSPjS-Met 10; see Figure 4). As noted in the Office Action (page 4): 

Applicant teaches proposed methionine enriched VSP/3 variants based on 
conserved amino acid residues within VSP homologues (pages 15-16 and 
Figure 2); positions of possible tolerated amino acid substitutions within 
VSP/3 (pages 16 -19); a strategy for isolating correctly folded methionine 
enriched variants of VSP/3 by testing for binding to a VSP/J speci fic 
antibody (pages 19-22) and methionine enriched variant VSP/3- [Met] 10 
binding to wild type VSPjS specific antibodies (page 1 9). 

RTA0l/2M059Kvl 
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Thus, the present specification provides exemplary nucleotide and amino acid sequences 
as well as guidance regarding evaluation of the functional limitations of the claims. Applicants 
believe that the teachings of the present specification, when considered by those of skill in the 
art, satisfy the written description and enablement requirements. 

Accordingly, Applicants respectfully submit that the rejections of claims under 35 U.S.C. 
§11 2, first paragr aph, should, be withdrawn. 

The Rejectio n of Claims Under 35 U.S.C .§1 12, Seco n d Para graph, 
Shoitkl Be W ithdrawn 

The Office Action (6/17/03, page 6) has rejected claims 1-20 under 35 U.S.C. §1 12, 
second paragraph, as being indefinite. Claim 1 was rejected for a variety of recitations and has 
been amended for clarification. Support for the amendments can be found throughout the 
specifications particularly on pages 4, 5, 7, 8, 13, and in the original claims. Claim 1 retains the 
phrase "capable of binding* 3 to indicate that the antibody, monoclonal antibody, antibody 
fragment, protein, or modified protein (see, e.g. 9 page 5 of the specification) binds to both the 
native protein and the engineered protein, but not at the same time. Applicants were concerned 
that if the term "binds" were used to describe both the binding of the engineered protein and the 
native protein, the claim might be interpreted to require simultaneous binding of both proteins. 
If this terminology is not acceptable, suggestions for acceptable terminology would be 
welcomed. Applicants also note that the term "essential amino acid" is discussed in the 
specification, for example, on page 7. Applicants respectfully believe that in view of the 
amendments to claim 1, this rejection should be withdrawn. 

Claim 2 was rejected for lack of comparative basis. As this claim has been amended lor 
clarification and is dependent on claim 1 which has also been amended for clarification, 
Applicants believe that this rejection should be withdrawn. 

Similarly, claim 3 has been amended for clarification and claims 4 and 13 have been 
amended to clarify that the particular vegetative storage protein of the claim is VSPaor VSP/3, as 
discussed throughout the specification. Claim 8 has been amended for clarification. Support for 
these amendments can be found throughout the specification, particularly on pages 4, 5, 7, 8, 13, 
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and in the original claims. Claims 9, 11, 12, and 13, which are dependent on amended 
independent claim 8, have also been amended for clarification. Support for these amendments 
can be found throughout the specification, particularly on page 7, and in the original claims. 
Claim 17 has been amended to correct dependency, and claims 1 8-20 have been amended to add 
an article. In view of the above amendments and comments, Applicants respectfully submit that 
the rejections of claims under 35 U.S.C. § 1 12, second paragraph, should be withdrawn. 

The Reject ion of .Claim s Under 35 U.S.C . 610 1 Should Be Withdraw n 
The Office Action (6/17/03, page 8) rejected claims 18-20 under 35 U.S.C. §101 as 
lacking utility because "[t]he seeds of Claims 18-20 encompass untransformed seeds; 5 These 
claims have been amended to require that the seeds are transformed. Accordingly, this rejection 
has been obviated by amendment and should be withdrawn. 

The Reject ion of Claims Under 35 U.S.C, 6102flrt Should Be Withd rawn 

The Office Action (6/17/03, page 9) has rejected claims 1-14, 17, and 18 under 35 U.S.C. 
§ 102(b) as anticipated by Jung et al WO 97/35023, published September 25, 1997. 

Applicants respectfully traverse this rejection. Applicants believe that the reference is 
not citable against the present application under 35 U.S.C. § 102(b). The Jung reference has an 
international publication date of September 25, 1997, whereas the present application has a 
priority date of December 10, 1997, less than one year la ter. Accordingly, the Jung re ference 
cannot be cited against the present application under 35 U.S.C. § 1 02(b). 

Independent claims 1 and 8 (and therefore claims 2-3 and 5-7 which are dependent on or 
incorporate the limitations of claim 1 and claims 942 and 14-20 which are dependent on or 
incorporate the limitations of claim 8) have been amended to include additional limitations. 
Particularly, independent claims 1 avid 8 now include the limitation that the native protein is 
VSPoior VSP/3. The Jung reference does not teach or suggest this limitation and therefore 
cannot anticipate the subject matter of the claims. 
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Further, Applicants note that the assignee on the Jung reference is the same assignee as 
the one on the present application and therefore the Jung reference is not available as a reference 
against the present claims under 35 U.S.C. § 103(c). 

In view of the above amendments and arguments, Applicants respectfully submit that the 
rejection of the claims under 35 U.S.C. § 102(b) should be withdrawn. 

The Office Action (6/17/03, page 10) has rejected claims 1-14, 17, and 18 under 35 
U.S.C. § 102(e) as anticipated by Tarczynski ei ah U.S. Pat. No. 6,080,913, filed September 25, 
1996. 

Independent claims 1 and 8 (and therefore claims 2-3 and 5»7 which ar e dependent on or 
incorporate the li mitations of claim 1 and claims 9-12 and 14-20 which are dependent on or 
incorporate the limitations of claim 8) have been amended to include additional limitations. 
Particularly, independent claims 1 and 8 now include the limitation that the engineered protein 
retains the conformation of the native protein, which is VSPaor VSP0. The Tarczynski 
reference does not teach or suggest this limitation and therefore cannot anticipate tire subject 
matter of the claims. 

Further, Applicants note that the assignee on the Tarczynski reference is the same 
assignee as the one on the present application and therefore the Tarczynski reference is not 
available as a reference against the present claims under 35 U.S.C. §103(c). 

In view of the above amendments and arguments, Applicants respectfully submit that the 
rejection of the claims under 35 U.S.C. § 102(e) should be withdrawn. 

The Rejection of Claims Under 35 U.S.C. 5103 Should Be Wi thdra wn 

The Office Action (6/17/03, page 1 1) has rejected Claims 1-20 under 35 U.S.C. § 103(a) 
over the Jung reference in view of Gordon-Kamrn ei ah (1990) The Plant Celt 2: 603-608. The 
^Office Action states that "[t]he claims are drawn to soybean and maize plants transformed with a 
nucleic acid molecule encoding a vegetative storage protein. . 

KTA01/2M0598vl 



PACE 13/22 * RCVD AT 12/18/2003 11:53:19 AM [Eastern Standard Time] * SVR:USPT -EFXRF-1/0* DNIS:8729306* CSID:91 98622260 * DURATI N <mm-ss):18-14 



DEC. - 1 8' 03 (THU) 11=55 



TEL = 9 198622260 



P. 0 1 4 



Ap.pL No.: 09/478,567 
Filed: January 6, 2000 
Page 12 

Applicants respectfully traverse this rejection. Applicants note that claims 1-4 are not 
drawn to transformed plants and therefore should apparently not have been included in this 
rejection. In addition, the claims that are drawn to transformed plants are not: limited to maize 
and soybean, as stated in the Office Action. Applicants also note that the Jung reference has the 
same assignee as the present application and therefore the Jung reference is not available to be 
cited against the present application under 35 U.S.C. § J 03(c). 

Accordingly, Applicants respectfully submit that the rejection of claims under 35 U.S.C. 
§ 103(c) should be withdrawn. 

Consideration Of Previously Submitted Information Disclosure Stateme nt 

It is noted that an initialed copy of the PTO Form 1449 that was submitted with 
Applicants 7 Information Disclosure Statement filed January 6, 2000, has not been returned to 
Applicants' representative with the Office Action. Accordingly, it is requested that an initialed 
copy of the Form 1449 be forwarded to the undersigned with the next communication from the 
PTO. In order to facilitate review of the references by the Examiner, a copy of the Information 
Disclosure St atement and the Form 1449 are attached hereto. Copies of the cited references were 
provided at the time of filling the original Information Disclosure Statement, arid, therefore, no 
additional copies of the references are submitted herewith. Applicants will be pleased to provide 
addi tional copies of the references upon the Examiner's request if it pro ves difficult to local e the 
original references. 

CONCLUSION 

In view of the above amendments and remarks, Applicants submit that the reject ions of 
the claims under 35 U.S.C. §§112, 101, 102, and 103 are overcome. Applicants respectfully 
submit that this application is now in condition for allowance. Early notice to this effect is 
solicited. 

If in the opinion of the Examiner, a telephone conference would expedite the prosecution 
of the subject Application, the Examiner is invited to call the undersigned. 
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It is not believed that extensions of time or fees for net add i ti on of claims are required, 
beyond those that may otherwise be provided for in documents accompanying this paper. 
However, in the event that additional extensions of time arc necessary to allow consideration of 
this paper, such extensions are hereby petitioned under 37 CFR § 1.136(a), and any fee required 
therefore (including fees for net addition of claims) is hereby authorized to be charged to Deposit 
Account No. 16-0605. 
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J hereby certify that tins correspondence is being deposited with the United States Postal Service with sufficient 
postage as first class; mail in an envelope addressed to: Mail Stop Non-Fee Amendment , Commissioner for Patents, 
P^OjjBox 1450, Alexandria, VA 22313-1450, on August 14, 2003 
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submitting the listed documents for the sake of Ml disclosure. 
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